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ABSTRACT 

Customers  served  by  an  M/M/1  queueing  system  each  receive  a  reward  R 
but  pay  a  holding  cost  of  C  per  unit  time  (including  service  time)  spent  in  the 
system.  The  decision  of  whether  or  not  a  customer  joins  the  queue  can  be 
made  on  an  individual  basis  or  a  social  basis.  The  effect  of  increasing  the  ar¬ 
rival  rate  on  the  optimal  policy  parameters  is  examined.  Some  limiting  results 
are  also  derived. 
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INTRODUCTION 

The  control  of  entry  of  customers  to  a  queue  is  a  virtual  necessity  in  some  cases  and 
highly  desirable  in  others.  Rosenshine  16]  presents  a  queue  with  state  dependent  service  times 
for  which  no  steady  state  distribution  exists  for  any  arrival  rate  and  for  which  control  of  entry 
must  eventually  be  exercised  in  the  presence  of  continuing  arrivals.  Even  queues  for  which 
steady  state  distributions  exist  can  often  be  operated  more  effectively  by  controlling  entry  of 
customers,  particularly  those  queues  in  which  customers  have  a  high  expected  waiting  time. 

Prior  to  the  appearance  of  Naor’s  work  15]  which  dealt  with  optimal  control  of  entry  to  a 
single-server  queue,  control  was  viewed  primaril’  »«  method  of  insuring  adherence  to  some 
arbitrary  externally  imposed  constraint,  e.g.,  litm  .  t  queue  size  or  expected  waiting  time, 
without  regard  for  economic  or  efficient  operation  u.  j  system.  Yechiali  [9]  dealt  with  the 
same  problem  as  Naor  in  a  different,  but  equivalent,  manner  and  provided  what  appears  to  be  a 
more  generally  applicable  procedure  for  determining  the  optimal  control  policy  and  the  gain  that 
can  be  achieved  through  its  employment. 

After  the  appearance  of  these  early  papers,  work  dealing  with  the  application  of  optimal 
control  of  arrivals  to  more  general  queueing  systems  began  to  appear.  Knudsen  [2]  and  Yechi¬ 
ali  (10)  extended  the  optimal  control  solution  to  s  servers,  the  Knudsen  work  considering  a 
slightly  more  general  cost  structure  than  heretofore  examined.  A  survey  by  Stidham  and  Pra- 
bhu  [7]  of  the  control  of  queueing  systems  work  also  pointed  out  that  many  of  the  papers  sur¬ 
veyed  had  common  considerations.  Subsequently,  Knudsen  and  Stidham  [3]  considered  the 
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case  for  which  a  new  customer's  net  benefit  for  joining  was  a  decreasing  function  of  the 
number  of  customers  already  present  in  the  system  when  the  new  customer  arrived.  Lippman 
and  Stidham  [4]  examined  the  optimal  control  of  a  queue  with  state-dependent  service  rate, 
increasing  with  the  number  of  customers  in  the  system,  and  random  rewards.  They  showed 
that  regardless  of  the  discounting  policy  and  time  horizon,  the  individual  optimum  policy 
admits  customers  to  the  system  whenever  the  social  optimum  policy  does  (and  perhaps  other 
times  as  well).  Stidham  [8]  extended  this  work  to  cover  a  GI/M/1  system.  Recently,  Johansen 
and  Stidham  [1|  have  shown  that  many  of  the  properties  of  and  relationships  between  the  indi¬ 
vidual  and  social  optimal  control  policies  apply  under  very  general  conditions,  e.g.,  dependent, 
nonidentically  distributed,  batch  arrivals. 

In  this  paper,  we  return  to  Naor's  model  and  examine  the  sensitivity  of  its  solution  to 
changes  in  the  arrival  rate,  A .  We  do  this  for  two  cases,  the  individual  optimum  and  the  social 
optimum.  In  the  former  case,  an  arriving  customer  acts  so  as  to  maximize  his  own  gain.  In  the 
latter,  a  customer’s  decision  to  join  the  queue  or  not  is  based  on  the  effect  of  the  decision  on 
the  gain  rate  (benefit  or  profit  per  unit  time)  of  all  customers  wishing  to  use  the  queueing  sys¬ 
tem.  For  the  individual  optimum  case,  the  gain  rate  is  shown  to  rise  as  A  increases  and  then 
fall  because  the  control  limit  (maximum  queue  system  size  at  which  a  customer  will  choose  to 
join  the  queue)  remains  constant  as  A  changes.  In  the  social  optimum  case,  the  maximum  gain 
rate  obtainable  is  proven  to  be  a  nondecreasing  function  of  A  whereas  the  control  limit  which 
yields  the  maximum  gain  rate  proves  to  be  a  nonincreasing  function  of  A .  Thus,  in  all  practical 
applications,  as  the  arrival  rate  rises,  it  is  possible  to  increase  the  gain  rate  at  the  expense  of 
level  of  service  (fraction  of  arrivals  allowed  to  enter  the  queue). 

A  potential  application  of  the  model  considered  lies  in  the  control  of  the  arrival  of  aircraft 
to  an  airport.  The  control  limit  determined  can  be  used  in  connection  with  an  examination  of 
actual  traffic  during  busy  periods  to  see  when  the  traffic  exceeds  the  control  limit.  The 
rescheduling  of  this  excess  traffic,  delaying  it  prior  to  take-off,  or  the  scheduling  of  this  traffic 
into  alternative  terminals  represent  actions  which  will  increase  the  overall  gain  rate  for  airline 
operations.  At  the  same  time,  fuel  expended  while  waiting  to  land  will  be  reduced. 

THE  MODEL  AND  ITS  SOLUTION 

The  model  is  an  M/M/1  queueing  model  with  arrivals  occurring  at  a  rate  A  and  having  a 
service  rate  capability  of  /i.  A  cost  structure  is  imposed  on  the  operation  of  the  system  in 
which 

a)  each  customer  served  receives  a  reward  of  R  dollars,  and 

b)  each  unit  of  time  a  customer  spends  in  the  system  costs  him  C  dollars. 

Each  arriving  customer  is  given  the  choice  of  joining  the  queue  and  receiving  reward  R  and 
paying  C  per  unit  time  in  the  system  or  of  not  joining  and  not  paying  or  receiving  any  money. 
Customers  are  assumed  to  decide  by  comparing  the  expected  net  gain  associated  with  each  deci¬ 
sion  and  choosing  the  action  with  the  larger  gain.  (In  case  of  a  tie,  the  customer  joins  the 
queue.)  This  is  the  model  considered  by  Naor  [5]  and  Yechiali  [9],  although  Yechiali  allowed  a 
general  arrival  distribution  and  a  slightly  more  general  cost  structure. 

Naor  argues  that  all  reasonable  strategies  lead  to  a  finite  capacity  queue.  He  determines 
n„  the  capacity  under  self-optimization  where  each  customer  considers  only  his  own  expected 
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net  gain  in  deciding  whether  or  not  to  join  the  queue.  The  expected  net  gain  for  joining  is 
R  —  (i  +  1)  C/m,  where  /  is  the  number  of  customers  the  arrival  finds  in  the  system.  Joining 
the  queue  thus  serves  the  self-interest  of  a  customer  if  /  is  less  than  ns,  where  ns  is  defined  by 

/?-(»,  +  1)C/m  <  0  <  'R  -  ns  C/fi. 

This  strategy  leads  to  an  M/M/1  /ns  queueing  system  for  which 

(1)  ns  -  [*m/C] 

where  [  ]  indicates  "the  greatest  integer  in." 

If  each  customer  or  an  administrator  acts  to  maximize  the  sum  of  the  individual  net 
benefits,  the  problem  becomes  the  social  optimum  problem.  Considering  an  infinite  horizon 
without  discounting,  Naor  sets  up  the  following  expected  overall  net  benefit  rate  function: 

(2)  g{n)-k'(n)R  -  C  Lin) 

where  A '(«)  and  Lin)  are,  respectively,  the  effective  arrival  rate  to  and  the  expected  number 
in  the  system  when  a  maximum  of  n  customers  is  allowed  in  the  system,  gin)  is  the  expected 
net  benefit  rate  or  expected  gain  per  unit  time  when  a  maximum  of  n  customers  is  allowed  in 
the  system.  The  units  of  gin )  are  dollars  per  unit  time. 

Another  form  of  (2)  will  be  used  in  the  remainder  of  this  paper.  This  form  is  derived  by 
noting  first  that 

A  'in)  -  A(1  -  9„in))  -  A  £  #,(») 

i-O 

where  9,in)  is  the  stationary  probability  that  there  are  /customers  in  the  system  when  the  balk¬ 
ing  point  is  n  and  second  that 

Lin)  -  Yj9j(n)-±-  £  j  --£</  +  1)  9,in). 

y-i  /*  y-i  **  (-0 

Thus,  (2)  can  be  written  as 

(3)  gin)  -  £  «,<«)  X  (J8  —  </  +  1)C/m). 

1-0 

Since  the  9,in)  represent  the  steady-state  probabilities  of  a  semi-Markov  process,  this  equation 
represents  the  gain  function  for  a  semi-Markov  decision  process  in  which  n  must  be  decided. 
The  optimum  value  of  n  can  be  determined  by  search  or  more  conveniently  by  policy  iteration. 
A  derivation  of  this  equation  directly  from  the  semi-Markov  decision  process  formulation  of 
this  problem  is  found  in  (9]. 

SOME  PROPERTIES  OF  OPTIMAL  CONTROL  POLICIES 

The  gain  rate  and  control  limit  for  the  individual  optimum  and  social  optimum  cases  can 
be  determined  from  Naor’s  results  for  a  given  value  of  A.  However,  the  arrival  rate  A  is  subject 
to  change.  For  example,  A  grows  as  the  usage  of  an  airport  grows.  Thus,  it  is  useful  to  know 
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how  the  optimal  control  policy  should  be  changed  in  response  changes  in  system  usage.  It  is 
these  changes  that  we  examine  here. 

An  increase  in  A ,  the  arrival  rate,  has  no  effect  on  the  solution  to  the  individual  optimum 
problem  because  a  self-optimizing  customer  considers  only  his  own  expected  net  benefit  in 
deciding  whether  or  not  to  join  the  system.  The  balking  point,  ns,  is  independent  of  A  as  indi¬ 
cated  by  (1).  Self-optimizing  customers  do  not  recognize  any  measure  of  the  overall  gain  of  all 
arrivals.  They  only  consider  their  own  expected  gain  which  does  not  depend  on  A .  However, 
for  any  policy  adopted  by  self-optimizing  customers,  the  gain  per  unit  time  can  be  calculated. 
Sample  results  are  shown  in  Table  1.  Comment  on  these  results  is  reserved  until  comparable 
social  optimum  results  are  presented. 


TABLE  1  —  Gain  Rates  Implied 
by  Individual  Optimum  for 
Various  Values  of  A 
(R  -  5,  C  =  2,  n  =  3) 


A 

g 

0.1 

7 

.431 

1.0 

7 

4.001 

2.1 

7 

6.537 

2.2 

7 

6.595 

2.3 

7 

6.623 

2.4 

7 

6.621 

4.02 

7 

4.637 

4.05 

7 

4.596 

5.0 

7 

3.556 

16.4 

7 

1.448 

16.6 

7 

1.441 

100.0 

7 

1.062 

The  solution  of  the  social  optimum  problem  is  affected  by  an  increase  in  A.  The  effect  of 
A  on  the  expected  gain  rate  of  the  system  is  considered  first. 

THEOREM  1:  g  is  a  nondecreasing  function  of  A. 

PROOF:  Define  g(Pt,  A)  to  be  the  expected  gain  rate  of  the  system  under  policy  P,  when 
the  arrival  rate  is  A.  Let  g*(A)  denote  the  maximum  expected  gain  rate  when  the  arrival  rate  is 
A.  Let  0,(1)  be  the  stationary  probability  that  an  arrival  is  allowed  to  join  when  the  system 
contains  /  customers  and,  similarly,  let  0,(0)  -  1  -  0,(1)  be  the  stationary  probability  that  an 
arrival  must  balk  when  /  are  in  the  system.  Yechiali  (91  shows  that  a  deterministic  control-limit 
policy  is  optimal  so  let  P\  -  {0,(1):  0,(1)  -  1,  /  <  «oi  0/(0  "0,  /'  ;>  n0)  be  such  that 

g*(A  ")  -  g(P\,  A").  Let  A’  >  A  "and  P2  -  (0,(1):  0,(1)  -  /  <  n0;  A< 0  -  0,  /  £  n0K 

where  n0  is  the  balking  point  from  policy  P\.  Under  policy  P2,  an  arrival  who  finds  fewer  than 
n0  customers  in  the  system  is  allowed  to  join  with  probability  A  '/A '  and  is  forced  to  balk  with 
probability  l-A'/A'.  Since  no  penalty  is  assessed  for  rejecting  a  customer,  g(P2,  A')  - 
g(P\,  A')  -  g* (A ') .  Finally,  since  g*(A’)  £  g(P2,  A'),  g*( A')  £  g*(A'). 
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Although  the  social  optimum  balking  point  can  be  determined  for  a  given  A  by  policy 
iteration,  the  range  of  A  for  which  the  balking  point  is  a  given  integer  can  also  be  found  as  fol¬ 
lows.  Define  {/(/)}  to  be  the  sequence  of  expected  rewards  for  joining,  where 
/(/)  -  R  -  Cii  +  1  )/p  is  the  expected  reward  if  i  customers  are  in  the  system.  Then,  from 

(3) , 

(4)  gin)  -  £  ®j(n)A/(/). 

i-O 

Naor  states  that  gin)  is  discretely  unimodal  in  n.  Thus,  n0  is  such  that  Ag(n0  +  1)  < 
0  <  Agin0),  where  Agin)  -  gin)  -  gin  -  1). 

Ag(n)  -  £  Qjinikfii)  -  £  9tin  -  1)A /(/) 

i-0  1-0 

-  £  (0,(«)  -  9,in  -  1))  A /(/)  +  0„_|(n)  A /(n  -  1). 

1-0 


Agin)  >  0  if 


9„^in)\fin  -  1)  >  £  (0,(n  -  1)  -  0,(n))X/(f). 
1-0 


If  p  -  A/p  1 ,  then  (5)  can  be  written  as 

(6)  (l-p)p"-'/(n-l)/0-p-+,)>  I 

It 0  (1  -  p")  (1  -  p"  ) 

After  formation  of  a  common  denominator  on  the  right-hand  side  and  division  by 
1(1  -  p)/(l  -  p"+1)},  (6)  becomes 

An  -  1)  >  £  p'/(«). 

(1  -  P>  ifo 


Substitution  of  £  p'  for  (1  -  p")/(l  -  p)  leads  to  Agin)  >  0  if 
1-0 

(7)  /(/i  -  1)  £  p'  >  p  £  p'fii),  n  >  2. 

1-0  1-0 

If  p  -  1,  <P,(n)  «  — j-r,  so  (5)  becomes 

W  T  1 

/(«  -  l)/(i»  +  1)  £  £  1  -  ~ T  /(,) 
if?  |  »  «  +  1 


»/(»  -  1)  £  ^  /(/). 


Since  the  limit  as  p  —  1  of  (7)  is  (8),  (7)  can  be  used  for  ail  values  of  p.  Thus,  for  a  given  p, 
no  is  the  largest  value  of  n  such  that  (7)  holds. 


If  p  is  held  constant,  (7)  can  be  used  to  find  the  range  of  A  for  which  n0  is  a  given 
integer.  (Note  Agin)  <  0  if  £  is  changed  to  <  in  (7).)  Finding  the  range  of  A  for  which 
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w0-  1  requires  finding  X  such  that  Ag(l)  >  0,  but  Ag(2)  <  0.  If  Ag(l)  <  0,  R  <  C/n  and 
the  system  is  trivial  since  customers  never  enter  it.  Thus,  Ag  (1)  >  0  for  all  X  since  giO)  =  0. 
Finding  the  range  of  X  over  which  n0  =  j,  j  ^  2,  requires  finding  X  such  that  Agij)  >  0  and 
Agij  +  1)  <  0.  Note  that  solving  (7)  for  X  such  that  Agin)  <  0  or  Agin)  >  0  involves 
finding  the  roots  of  the  n  -  1  degree  polynomial  resulting  from  Agin)  -  0.  Use  of  (7)  is 
demonstrated  by  an  example. 


Suppose  that  the  reward  for  service  is  R  —  5,  the  cost  per  unit  time  in  the  system  is 
C  -  2  and  the  service  rate  of  the  single  server  is  p  =  3.  {/(/))  —  {/?  -  Cii  +  l)/p)  “  { 1 3/3, 
11/3,  3,  7/3,  5/3,  1,  1/3,  —1/3,  . ..).  (Note  that  n5  =  7.)  First,  find  the  range  of  X  for  which 
n0  =  1.  Ag(l)  >  0  for  all  X.  A/? (2)  <  0  if  from  (7) 

i  o 

/(l)  £p'  <  p  ]Tp'/(/)  or  16.5  <  X. 

i-0  /-0 


Thus,  n0  —  1  if  X  >  16.5.  Now  find  the  range  of  X  over  which  n0  =  2.  Agi2)  >  0  if 
X  <  16.5.  A#(3)  <  0  if  from  (7) 

2  1 

/( 2)  2>'  <  P  2> '/(/■)  or  — 2X^  —  I2X  +  81  <  0. 

1-0 

Solution  of  the  above  yields  Ag(3)  <  0  if  X  >  4.035.  Therefore,  n0  =  2  if  4.035  <  X  <  16.5. 
Further  use  of  (7)  yields  w0  =  3  if  2.1  <  X  <  4.035.  Table  2  presents  the  solution  found  using 
policy  iteration  for  several  values  of  X.  These  results  confirm  (7)  and  Theorem  1. 


TABLE  2  —  Policy  Iteration  Results  for 
Various  Values  of  X 
iR  -  5.  C  -  2,  M  =  3) 


X 

no 

g 

0.1 

7 

0.431 

1.0 

5 

4.003 

2.1 

4 

6.944 

2.2 

3 

7.128 

4.02 

3 

8.993 

4.05 

2 

9.011 

16.4 

2 

10.998 

16.6 

1 

11.010 

100.00 

1 

12.621 

Comparison  of  Tables  1  and  2  illustrates  that  no  <  n,  and  that  the  gain  rate  associated 
with  the  individual  optimum  policy  is  no  greater  than  the  social  optimum  gain  rate.  Also,  the 
gain  rate  associated  with  the  individual  optimum  policy  reaches  a  peak  and  then  declines  as  X 
increases.  On  the  other  hand,  the  social  optimum  gain  rate  is  nondecreasing  in  X  as  shown  in 
Theorem  1. 

The  value  of  n0,  the  forced  balking  point  for  the  social  optimum  problem,  appears  to  be 
nonincreasing  as  the  arrival  rate  increases  in  Table  2.  This  result  is  established  as  Theorem  2. 
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THEOREM  2:  n0  is  a  nonincreasing  function  of  X. 

PROOF:  R  and  p  are  assumed  to  be  finite  and  C  is  assumed  to  be  nonzero.  Since  Naor 
proved  that  g  is  discretely  unimodal  in  n,  nQ  is  optimal  if  A gin0  +  1)  <  0  <  A gin0).  Let 
n0i\)  denote  the  optimal  forced  balking  point  when  the  arrival  rate  is  X.  Let  n  -  /i0(X'), 
where  X  "  is  a  fixed  value  of  X  and  n  >2.  From  (7) 

A  gin)  >  0  if 

fin  -  1)  jr  p‘  >  £  p'/(/). 

/- o  i-O 

for  n  >  2,  which  can  be  written  as 
Agin)  >  0  if 

(8)  fin  -  1)  (1  +  p  +  . . .  +  p"~')>  p/(0)  +  p2f(l)  +  . . .  +  p"-'fin  -  1). 

As  X  increases,  p  increases.  Since  I/O))  decreases  as  i  increases,  /(0)  >  fi\)  >  ...  > 
fin  -  2)  >  fin  -  1).  Thus,  as  X  increases,  the  right-hand  side  of  (8)  increases  faster  than 
the  left-hand  side.  This  eventually  leads  to  Agin)  <  0  for  X  greater  than  or  equal  to  some 
X'  >  X".  Therefore,  n0(\’)  <  n0(\").  Since  A^d)  >  0  for  all  X,  continuation  of  the  above 
argument  leads  to  the  existence  of  X'"  such  that  n0(\)  -  1  for  all  X  >x"'.  Thus,  as  X 
increases,  n0  decreases  until  it  becomes  equal  to  one. 

Since  n0  has  been  shown  to  be  a  nonincreasing  function  of  X  and  g  has  been  shown  to  be 
a  nondecreasing  function  of  X,  the  question  of  existence  of  the  limits  as  X  —  °°  of  n0  and  g 
arises.  From  Theorem  2,  n0  decreases  as  X  increases  until  n0  =  1.  n0  remains  one  for  further 
increases  in  X .  Thus, 

lim  «o  =  1. 

x— oo 

If/»o-  1. 

90in0)  -  0O(1) - and  *,<«0)  =  0,(1) - . 

~r  K  fJ,  "r  A 

From  (4) 

*(1)«  £0,(1)X/(/)-mX/(O)/(m  +X). 

i-O 

Dividing  numerator  and  denominator  of  the  right-hand  side  by  X  and  letting  X  —*  °°,  yields 

(9)  lim  g(l)  -  pfiO). 

k—  «© 

For  the  example  given  previously  lim  g(l)  -  13. 

\—oo 

Equation  (9)  can  also  be  justified  intuitively.  If  no  queue  is  allowed  to  form,  customers 
can  only  join  if  the  system  is  empty.  The  expected  reward  for  a  joining  customer  is  /(0).  The 
expected  time  between  departures  of  customers  from  the  system  is  I/p  plus  the  expected  time 
between  the  completion  of  a  service  and  the  arrival  of  the  next  customer.  In  the  limit,  an 
arrival  occurs  at  the  instant  a  service  is  completed  so  the  system  is  never  empty.  Thus,  the 

expected  gain  rate  of  the  system  as  X  — '  <»  is  -  p  /(0). 

1/m 


532 


R.  C.  RUE  AND  M.  ROSENSHINE 


SUMMARY 

The  effect  of  an  increasing  arrival  rate  on  the  individually  and  socially  optimal  control  pol¬ 
icies  for  entry  to  an  M/M/1  queue  has  been  investigated.  The  results  presented  are  useful  in 
determining  the  sensitivity  of  the  optima!  control  policies  to  changes  in  the  customer  arrival 
rate. 
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ABSTRACT 

Suppose  one  object  is  hidden  in  the  A-th  of  n  boxes  with  probability  pikl 
We  know  the  probability  </(r.  k)  of  detecting  the  object  if  it  is  hidden  in  box  k 
and  wic  expend  effort  r  searching  box  k.  Our  aim  is  to  minimize  the  expected 
search  effort  of  a  successful  search  Previously  this  problem  has  been  solved 
only  under  the  assumption  that  the  functions  </(-.  k  I  are  concave  We  prove, 
without  concavity  assumptions,  the  existence  of  an  optimal  distribution  of 
search  effort  and  give  a  procedure  for  its  construction. 


INTRODUCTION 

An  absent-minded  person  repeatedly  loses  his  glasses  in  his  own  house.  In  order  to 
search  optimally  for  the  glasses  he  estimates  the  following  data:  p(k)  for  k  €  B  = 

(1 . n ) ,  the  a-priori  probability  that  he  has  lost  his  glasses  in  the  A-th  room,  and  qit.  k) 

for  t  €  Rq  and  k  6  B,  the  conditional  probability  that  he  finds  ahe  glasses  if  he  has  lost  them 
in  room  k  and  he  searches  this  room  for  a  time  period  of  length  t.  We  assume  that  q(\  k)  is  a 
probability  distribution  function,  i.e.,  it  is  increasing  and  right-continuous. 

More  important  research  areas  where  we  can  apply  this  problem  are  the  following:  the 
search  for  a  defect  in  a  large  system,  the  search  for  oil  or  a  mineral  in  different  regions  and  the 
search  for  a  sunken  ship  where  we  partition  the  ocean  into  cells.  In  the  following  we  discuss 
the  search  for  an  object  hidden  in  one  of  a  finite  number,  n,  of  boxes. 

Now  we  have  to  decide  how  we  should  search  for  the  hidden  object.  A  search  procedure 
(fj,  A',)  for  /  €  IN,  i,  €  R+  and  k,  €  B  prescribes  the  following  behaviour:  search  box  k{  for  /, 
hours,  then  search  box  k2  for  i2  hours  and  so  on  until  the  object  has  been  found. 

These  search  strategies  are  appropriate  if  only  one  person  searches  for  the  object.  But 
while  searching  for  a  sunken  ship  there  may  be  many  boats  or  planes.  In  this  case,  we  can 
specify  that  one  half  of  them  searches  cell  l,  another  third  searches  cell  5  and  so  on.  We 
assume  that  the  number  of  boats  and  planes  is  so  targe  that  it  is  a  reasonable  approximation  to 
assume  that  we  can  divide  the  search  effort  in  an  arbitrary  fashion. 
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A  strategy  is  a  function  s  :R0+  x  B  —  R<f  where  sit.  k)  specifies  how  much  of  the  search 
effort  t  has  to  be  spent  in  box  k.  Therefore,  £  s(t.  k)  =  t  and  s(\  k)  is  increasing.  By 

kiB 

Qis,  t )  we  denote  the  probability  that  we  find  the  object  when  we  spend  search  effort  r.  Obvi¬ 
ously, 

()(s.  /)  ”  £  p(k)q(s(t,  k),  k) 

MB 

and 


Q(s,  °°)  =  1  -  lim  Q(s.  t) 

l—oo 

is  the  probability  that  we  never  find  the  object  if  we  use  s.  The  function  Q(s.  •)  is  the  probabil¬ 
ity  distribution  function  of  the  random  variable  Y'(s)  which  measures  the  search  effort  which  is 
necessary  to  find  the  object  by  strategy  s.  The  mean  of  K(s)  is  the  expected  search  effort  of  s 
and  will  be  denoted  by  E(s).  It  is  a  well  known  result  of  probability  theory  that 

E(s)~  f  t  dQ  (s,  •)  =  /  (1  -  Q(s,i))dt. 

10.«>1  lO.oo) 

A  strategy  s  is  called  optimal  iff  its  expected  search  effort  is  minimal. 

Gilbert  [3]  and  Kisi  [5]  determined  optimal  strategies  for  some  definite  concave  functions, 
q(\  k).  Onaga  [6]  gave  a  solution  of  the  problem  under  the  assumption  that  all  q(-.  k)  are 
concave.  As  we  shall  see  in  this  paper  the  assumption  of  Onaga  is  a  considerable  simplification 
of  the  problem.  Here  we  have  to  mention  that  these  three  authors  tried  to  solve  a  more  gen¬ 
eral  and  much  more  difficult  search  problem.  They  assumed  that  one  has  to  pay  some  extra 
costs,  called  switch  costs,  if  one  changes  the  place  of  search.  Stone  [7]  proved  the  results  of 
Onaga  using  new  methods  which  we  present  in  Section  2. 

In  Section  1  we  give  a  necessary  and  sufficient  condition  for  the  existence  of  a  strategy 
with  finite  expected  search  effort.  In  Sections  2  and  3  we  repeat  briefly  the  solution  of  the 
problem  under  the  assumption  that  the  functions  <?(-,  k)  are  concave.  Later,  we  shall  use  these 
results  for  the  solution  of  the  general  problem. 

In  Section  2  we  assume  that  our  funds  are  limited  by  T  €  R0+.  The  best  we  can  do  is  to 
distribute  this  search  effort  in  such  a  way  that  the  probability  of  finding  the  object  is  maximal. 
A  F-admissible  allocation  is  a  function  a  :  B  —  R0+  where  £  a(k)“  T.  The  allocation  a 

kiB 

prescribes  that  we  spend  a(k)  for  the  search  of  box  k.  The  probability  of  success  of  a  becomes 
P(a)—  £  p(k)q(a(k).k). 

kiB 

We  construct  a  T-optimal  allocation.  In  Section  3  we  use  this  result  and  construct  an  optimal 
strategy. 

In  the  following  sections  we  do  not  assume  that  q(\  k)  is  concave.  Our  main  result  is 
the  following.  Let  qi\  k)  be  the  smallest  concave  function  nowhere  smaller  than  q(\  k).  The 
optimal  strategy  of  Section  3  for  the  search  problem  where  q(\  k)  is  replaced  by  ?(•.  k),  is 
also  optimal  for  the  problem  with  the  functions  q(\  k).  In  Section  4  we  show  how  we  may 
improve  strategies  and  in  Section  5  we  prove  the  main  result. 
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Finally  (Section  6)  we  compare  our  results  with  two  similar  search  problems.  First  is  the 
well-known  problem  where  one  is  not  allowed  to  divide  the  search  effort  in  an  arbitrary  fashion, 
but  where  the  cost  of  the  >th  search  of  box  k  and  the  probability  of  overlooking  the  object  dur¬ 
ing  this  search  (even  if  one  searches  the  right  box!  is  given.  Second  we  consider  the  search 
problem  where  the  hidden  objec.  is  a  point  of  R". 

1.  THE  EXISTENCE  OF  A  STRATEGY  WITH  FINITE  EXPECTED  SEARCH  EFFORT 

Without  loss  of  generality,  we  assume  that  p(k)  >  0  for  ali  k.  Let  £(s|k)  be  the 
expected  search  effort  of  strategy  s  if  the  object  is  hidden  in  box  k.  Since 

£(s)  -  £  p(k)E(s  |k), 

ktB 

E(s)  is  finite  iff  E(s\k)  is  finite  for  all  k.  If  the  object  is  hidden  in  box  k  the  best  we  can  do  is 
to  search  only  box  k.  That  means  s*  defined  by  sk(t,  k)  -  i  and  sk(r.  k')  -  0  for  k'  ^  k  is 
optimal  in  that  case.  Therefore, 

£(s|*)  >  £(s4|*>-  /  (I  -<?(/.  k))dt 
|0.  <»  ) 

and  the  strategy  s  may  have  finite  expected  search  effort  only  if  £(i*|A)  is  finite  for  all  k. 

THEOREM  1:  There  exists  a  strategy  with  finite  expected  search  effort  iff 
J  ( 1  -  q  (r,  k  ))dt  <  °°  for  all  k. 

[0.  oo  ) 


PROOF:  We  have  already  proved  the  only-if  part.  Now  we  may  assume  that 

£(s*|k)  —  f  (I  -  q(t.  k))dt  <  oo 
|0  «• ) 

for  all  k.  Let  s  defined  by  s(t,k)  -  tin  be  the  strategy  which  distributes  the  search  effort 
equally  to  all  boxes.  Obviously, 

£(s|k)  -  n£(s*|k)  <  oo 


and 


£(s)-  £  p(*)£(j|A)  <  oo. 
kiB 

In  the  following  we  assume  the  existence  of  a  strategy  with  finite  expected  search  effort. 

2.  OPTIMAL  ALLOCATIONS  FOR  THE  CONCAVE  CASE 

In  this  chapter  we  assume  that  the  functions  q(\  k)  are  concave.  Charnes  and  Cooper 
[2]  solved  this  problem  for  some  definite  concave  functions  q(r.  k).  A  general  solution  has 
been  obtained  by  Stone  [7],  We  cite  his  results  and  then  we  choose  a  special  r-optima!  alloca¬ 
tion  which  will  be  useful  for  our  considerations  for  nonconcave  functions,  q  (\  k ). 

Stone  used  optimization  techniques  involving  Lagrange  multipliers.  These  basic  tech¬ 
niques  for  the  theory  of  search  have  been  presented  by  Wagner  and  Stone  181.  We  need  the 
following  function  /  defined  by 
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l(k,  X,  /) :-  p(k)q(t,  k)  -  X/  for  k  €  fl,  X  €  Rq  and  /  €  Ro 

which  is  called  the  pointwise  Lagrangian.  Stone  (7]  has  shown  that  a  /-admissible  allocation  a, 
is  /-optimal  if  for  some  X  >  0 

l(k.  X .  a,(k))  -  max{/(X,  X ,  z)|z  6  Rq  )  for  all  k  €  B. 

The  problem  of  maximizing  the  pointwise  Lagrangian  is  a  problem  of  calculus.  The  following 
lemma  states  some  easy  facts. 

LEMMA  1:  Let  q(\  k)  be  a  concave  probability  distribution  function  on  Rq  .  q'(t.  k) 
lim  h~l(q(t  +  h.k)  -  qit.k ))  is  well  defined  and  nonnegative  for  /  >  0.  It  may  happen 

A— 0.A>0 

that  q'(0.  k)  -  °o.  q'(\  k)  is  decreasing  and  right-continuous.  'q(t,  k) lim  h~Hq (/  + 

A-O.ACO 

h.  k)  -  q(t.  k))  is  well  defined  and  nonnegative  for  /  >  0.  'q(\  k)  is  decreasing  and  left- 
continuous. 

q'(t.  k)  <  'q(t.  k)  for  all  /  and  k. 

By  these  properties  it  is  easy  to  see  that  a,  maximizes  the  pointwise  Lagrangian  for  some  X  >  0 
if 

p(k)q'(a,(k),  k)  <  X  and  p(k)’q(a,(k).  k)  >  X  if  a,(X)  >  0 

and 

p(k)q'(a,(k) .  k)  <  X  if  a,(k)  -  0. 

We  now  choose  for  each  /  >  0  a  definite  /-optimal  allocation  in  the  following  way.  Let 
T*( L ,  k)  sup(/  €  R0+  \p(k)q'U.  k )  >  L) 

,and 

7*  ( L .  k)  :  -  sup(/  €  Ro  \p(k)q'(t,  k)  >  L) 
thi  L  €  Ro  and  *  €  B. 

Obviously,  T*  .  k )  -  7*  («® ,  k )  -  0  and  7*  (0.  X )  -  <» .  For  each  /  6  Ro  we  may 
define  an  allocation  a*  with  the  following  properties:  a*  is  /-admissible;  there  is  an  LU)  €  R0+ 
and  a  box  jit)  so  that  af(k)  -  T*(L(t).  k)  for  k  <  ./'(/),  a?(X)  -  T*  (.L(t),  k)  for  k  >  j(t) 
and  ari/U))  €  I T*(Ut).k).  T*  (LU).  k)l 

Again,  by  lemma  1  and  our  results  above,  a*x  maximizes  the  pointwise  Lagrangian  for 
X  -  Hi)  and  is  therefore  /-optimal. 

THEOREM  2:  a*  is  an  optimal  allocation  of  the  search  effort  /. 

3.  OPTIMAL  STRATEGIES  FOR  THE  CONCAVE  CASE 

By  the  definition  of  a,*  we  may  conclude  that  af(k)  is  increasing  as  a  function  of  /.  Thus, 
t*.  defined  by  **(/.  k)  :  -  a,*(X),  is  a  strategy.  Again,  by  the  results  of  Stone  [7],  s*  is  optimal. 
This  may  be  seen  directly,  also.  Let  s  be  a  strategy.  Then  a,  defined  by  a,(X)  :  -  sU,  k)  is  a 
/•admissible  allocation.  By  theorem  2 
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£(s*)-  /  (1  -Q(s\t))di-  f  (1  —  P(a*))dl 

|0.  <»> 

<  J  <1  -P(a,))dt~  /  (1  -  Q(s.  i))dt  -  £(s 

It).  oo> 


). 


THEOREM  3.  s*  is  an  optimal  strategy. 

4.  AN  IMPROVEMENT  RULE 

For  each  search  problem  which  is  defined  by  n.  p  and  q  we  define  the  dual  problem  by  n,  p 
and  q  where  n  n,p:  —  p  and  q{‘.  k )  is  the  smallest  concave  function  nowhere  smaller  than 
</(•.  k).  By  elementary  analysis  one  can  prove  that  for  each  ?(•,  k)  there  is  a  (perhaps  empty) 
set  of  disjoint  intervals  (fj.  1 2)  so  that  outside  of  these  intervals  q(\  k)  and  q(\  k)  coincide, 
while  q(\  k)  is  on  each  interval  ( t\ ,  t2)  a  linearly  and  strictly  increasing  function  larger  than 
</(\  k).  We  denote  these  intervals  by  (/t.  i2.  k)  to  mark  the  appertaining  box.  The  set  of  all 
these  triples  will  be  called  /. 

Let  us  investigate  the  optimal  strategy  s *  for  the  dual  problem  which  we  have  constructed 
in  Sections  2  and  3.  The  search  effort  for  box  k  increases  from  f,  to  r2  within  a  time  interval  of 
length  t2  —  t,  if  (t,.  t2.  k)  €  /.  That  means  s*  prescribes  that  we  have  to  search  for  a  time 
period  of  length  r2  —  t\  only  for  box  k  and  the  search  effort  for  k  increases  during  this  time 
period  from  f|  to  r2.  This  will  be  called  the  (r,.  t2.  k)  -  property. 

In  the  following  we  shall  see  that  it  would  not  be  sufficient  to  use  an  arbitrary  optimal 
strategy  for  the  dual  problem  instead  of  an  optimal  strategy  fulfilling  the  (r , .  r2.  k)  -  property 
for  each  (r,  r2,  k)  €  I.  We  explain  this  fact  in  the  following  way.  If  we  increase  the  search 
effort  for  the  box  k  from  / 1  to  /2,  then  our  profit  is  increased  by  p(,k)q(t2  k)~  p(k)q(ii,  k) 
while  our  search  effort  increases  by  t2  -  t\.  The  quotient  of  profit  and  cost  may  be  called 
efficiency.  If  (r j ,  r2.  k)  6  /  then  it  is  less  efficient  to  increase  the  search  effort  for  box  k  from 
r,  to  t\  +  h  (for  h  <  t2  —  /t)  than  to  increase  it  from  t\  +  h  to  t2.  Then,  using  the  ideas  for 
the  solution  of  discrete  search  problems  (Wegener  |9)),  we  may  believe  that  good  strategies 
have  the  (tt,  t2,  k)  -  property.  The  following  lemma  makes  these  ideas  precise. 

LEMMA  2:  For  each  strategy  s  and  for  each  (r t2,  k)  €  /  there  exists  a  strategy  s'  with 
the  (f|,  t2.  k)  -  property  which  is  at  least  as  good  as  s. 

PROOF:  If  s  has  infinite  expected  search  effort  the  assertion  is  obviously  correct.  Other¬ 
wise,  we  may  choose  7j  €  R0+  and  T2  6  R0+  so  that  s(7j,  k)  -  1,  and  s{T2,  k)  -  t2.  The  case 

where  we  have  to  choose  T2  -  <»  is  a  little  more  complicated.  We  give  our  proof  for  the  case 

r2  <  «  and  we  add  some  comments  as  to  how  we  change  the  proof  if  T2  -  <». 

Let  H  be  the  set  of  all  functions  h  :  (0,  T2  -  F|]  — *  [0.  r2  —  r,)  where  0^  h(x)~ 

h(y)  <  x  -  y  if  x  >  y,  A(0)  —  0  and  h(T2-  7j)  -  f2  -  t|. 

In  the  following  we  restrict  ourselves  to  strategies  ?  where  s(r.  •)  -  s(r,  •)  for  /  ^  T,  or 
1  >  T2.  For  such  a  strategy  J  and  x  €  10, T2-  F,l,  let  h(.v):-  s(T\  +  x,  k)  -  £(7j.  k). 
Obviously,  h  €  H. 
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On  the  other  hand,  we  may  define  for  h  6  H  a  corresponding  strategy  in  the  following 

way: 

sit.  •) sit.  •)  if  t  ^  Tx  ot  t  ^  T2. 
sit.  k)  :  —  fc)  +  /t(r  —  Tx)  if  Tx  <  t  ^  T2  and 
sit.  k ') siTit).  k')  if  Tx  ^  r  <  7"2  and  k‘  &  k 
where  f(r)  is  chosen  so  that  JT  s(r,  k")  =  t.  This  definition  means  that  we  treat  all  boxes, 

A  "i  B 

k'.  k  k ,  in  the  same  way.  If  for  s  there  is  a  point  of  time  T  where  siT.  k')  -  r'  and 
s(T.  fc  ")  -  /"then  there  is  a  point  of  time  T  where  1(7",  k')  =  /'  and  HT'.  k")  »  r". 

Let  //*  be  the  following  subset  of  W.  For  h  €  H*  there  exists 
.v  €  lO.Tj  -  T|  —  (»2  -  f|)]  so  that  hiy)**  0  and  h(y  +  t2- tt)  —  t2— tx.  Then  the 
corresponding  strategy  s  does  not  spend  any  effort  for  box  k  in  the  intervals  [T\,  y\  and 
ly  +  t2  -  tx.  Tj)  while  we  search  only  box  k  from  the  point  of  time  y  to  y  +  t2  -  tt.  This 
function,  h.  will  now  be  denoted  by  hv.  We  conclude  that  a  strategy  corresponding  to  a  func¬ 
tion  of  H  has  the  it\.  t2.  k)  -  property  iff  the  corresponding  function  h  is  an  element  of  H*. 

But  at  first  we  consider  another  subset  Hm  of  H  and  the  corresponding  set  of  strategies 
Sm.  h  €  Hm  iff  we  may  divide  the  interval  (0.  T2  -  7"|1  to  2m  parts  so  that  h  is  constant  on  m 
parts  of  length  iT2-  7j  -  it2  ~  tx))l m  each  while  on  the  other  m  parts  of  length  it2  -  tx)/m 
each  h  is  linearly  increasing  with  slope  1. 

If  T2  -  °°  the  first  m  parts  are  replaced  by  m 2  parts  of  length  it2  -  tx)/m  each,  and  one 
part  of  infinite  length,  which  of  course  is  always  the  last  of  all  m1  +  m  +  1  parts.  Since  obvi¬ 
ously  Hm  is  a  finite  set  there  exists  a  best  of  all  strategies  in  Sm. 

In  order  to  compute  a  best  strategy  in  Sm  we  have  to  decide  how  to  arrange  the  m  periods 
when  we  search  box  k  and  the  m  periods  (if  T2=  °°  m2  periods)  when  we  search  other  boxes. 
We  assume  that  we  are  not  able  to  find  the  hidden  object  during  a  period  of  search,  but  that  we 
find  the  object  at  the  end  of  a  period  with  the  same  probability  which  formerly  was  the  proba¬ 
bility  of  finding  the  object  during  the  whole  period.  By  this  assumption  the  expected  search 
effort  of  all  strategies,  s  6  Sm,  is  increased  by  the  same  amount.  Therefore,  a  best  of  all  stra¬ 
tegies  in  Sm  for  the  new  situation  is  also  the  best  strategy  in  Sm  for  the  original  situation. 

Now  we  are  in  a  situation  where  we  have  to  arrange  optimally  m  searches  of  box  k  and  m 
other  searches  (if  f2  =»  m2  other  searches)  and  where  we  may  find  the  object  only  at  the 
end  of  a  search.  This  is  the  situation  of  a  discrete  search  problem  which  has  been  solved  by 
Wegener  [9].  By  the  results  of  that  paper  the  searches  of  box  k  belong  together  since 
(/,.  t2,  k)  6  /.  Therefore,  s* ,  the  best  strategy  in  Sm ,  prescribes  the  m  searches  of  box  k  one 
after  another.  Hence,  the  corresponding  function  is  of  the  form  hv  for  an  appropriate  ym. 

Now  we  like  to  approximate  the  given  strategy  s  by  a  strategy  s»  (  S,.  By  elementary 
analysis  it  is  easy  to  define  sm  in  such  a  way  that  the  event  that  the  search  effort  for  box  k'  €  B 
reaches  t  6  Ro  if  we  use  sm  has  a  delay  of  at  most 

max((/2  -  tx)/m,  iT2  -  Tx  -  (/2  -  tx))/m]  <  iT2  -  Tx)/m 
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compared  with  the  same  event  for  the  given  strategy  s.  Therefore, 
£(s;)  <  £<$.>■<  £(s)  +  (T2  -  Tt)/m  and 
lim  £■  <s*  >  <  £(s). 

m 


If  T2—  °°  the  delay  is  at  most  (t2-  i\)lm  plus  that  search  effort  for  box  k  which  is 
prescribed  by  s  after  the  point  of  time  where  we  now  begin  with  the  period  of  infinite  length. 
This  point  of  time  is  7",  +  (/2  -  r , )  +  m(t2  -/,)—<»  for  m  —  Since  lim 

l—oo 

s(r.  k)  -  t2  <  °°  jhe  second  term  of  the  delay  tends  to  0,  too.  We  obtain,  in  the  same  way  as 
above,  the  result  lim  £(s*)  ^  £(s) 

m  — «> 

For  j*  we  have  defined  the  corresponding  function  //,  .  Since  ym  €  [0,  T2—  Tt  — 
ir2-  /|>),  there  is  a  subsequence  ym-  of  ym  which  converges  to  v'  f  (0,  T2  -  T\-  (t2-  r, )] 
If  T2  =  oo  we  may  easily  prove  that  the  corresponding  strategy  s0  of  is  better  than  the 
corresponding  strategy  of  /»,  if  >•  >  A  for  an  appropriate  A  <  °°  Since  s0  €  S„  for  all  m  we 
conclude  that  ym  €  (0.  A  I  and,  for  that  reason,  has  a  convergent  subsequence 

We  define  s'  as  the  strategy  corresponding  to  )»,•.  s'  has  the  ( / , .  t2.  k)  -  property  and 

£(s  )  Urn  £(s*  •)  <  him  £<s* )  ^  £<s).  0  E  D. 

m  — »  m  — 00 

REMARK.  There  is  at  most  one  triple  (t{.  l2.  k ')  so  that  s  has  the  (l{.  t2.  k')  -  property 
and  s'  has  not.  But  this  interval  has  been  broken  into  only  2  parts.  Again,  by  an  easy  applica¬ 
tion  of  the  results  19J  we  obtain  a  strategy  s"  at  least  as  good  as  s  which  has  the  < r, .  t2.  k)  - 
property  and  the  (r*.  /J.  k*)  ■  property  for  all  triples  for  which  s  has  this  property. 

We  note  that  we  have  not  yet  proved  that  we  may  restrict  ourselves  to  strategies  which 
have  the  (fp  i2,  k)  •  property  for  all  triples  of  /  If  we  apply  lemma  2  and  the  remark  for  all 
triples  of  /,  one  after  another,  we  have  to  apply  it  (in  certain  cases)  for  infinitely  many  times. 
It  is  not  obvious  that  the  resulting  sequence  of  strategies  converges  to  a  strategy  which  is  at 
least  as  good  as  the  given  strategy. 

5.  OPTIMAL  STRATEGIES  FOR  THE  GENERAL  CASE 

By  S'  we  denote  the  set  of  all  strategies  which  have  the  (/|.  i2.  k)  -  property  for  all  triples 

of  /. 


LEMMA  3:  The  optimal  strategy  s'  which  we  have  constructed  for  the  dual  problem  is 
the  best  strategy  of  S'  for  the  original  problem. 

PROOF:  Let  s  €  S*.  For  each  (r,.  t2,  k)  €  /  there  exist  T2  €  so  that 
T2  -  T\  -  t2~  i{,  s  searches  only  box  k  during  the  period  of  time  [ 7", ,  T21  and  the  search 
effort  for  k  increases  during  this  period  of  time  from  f|  to  i2.  For  i  C  IF,,  r2l 

@(s.  I )  -  Q(s.  i )  -  p(k)(.q(t\  +  t  -  T\.  k)  -  +  t  -  7", ,  A ) > 

is  independent  of  s  6  S*  and  for  all  other  r  €  R0+ 

Qis.  r)  -  Q(s,  t)-  0. 
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Thus,  £(s)  -  £(s )  is  independent  of  s  €  S*  if  £(s)  <  °°.  (£and  Q  indicate  that  we  use  s  for 
the  dual  problem.)  If  £(s)  —  °°  we  conclude  £(s)  —  »  since  £(s)  ^  £(s).  By  definition 
s*  €  S*  and  by  Theorem  3  s*  is  optimal  for  the  dual  problem.  Combining  our  results  we  have 
proved  the  assertion.  Q.E.D. 

LEMMA  4:  The  optimal  strategy  s *  which  we  have  constructed  for  the  dual  problem  has 
finite  expected  search  effort  for  the  original  problem. 

PROOF:  We  have  assumed  that  there  exists  a  strategy  s  with  finite  expected  search  effort. 

Thus, 

J(l-  (>(s*,  t))dt  -  £(s*)  £(s)  ^  £(s)  <  oo. 

10.  «>  > 

By  definition 

£(s*)  -  /  (1  -  Q(s\  t))dt  -  /  (1  ~Q(s\t))di 

|0.O°>  |0.°°> 

+  /  ((Ms*.  i )  -  Q(s.  t))dt 

(O.oo  ) 

where  the  first  term  is  finite. 

Since  s*  €  S*  we  may  divide  (0,°°)  into  Intervals  where  Q(s%.  t )  —  Q(s *.  t)  and  intervals 
(corresponding  to  the  triples  of  f)  where  Q(s*.  t)  is  Jinearly  increasing  and  @(s*.  t)  < 
Q(s*.  t ).  We  may  reflect  the  area  between  Q(s*.  t )  and  Q(s*.  t)  at  the  straight  line  Q(s*.  /). 
Since  the  reflected  areas  are  disjoint  and  lie  between  Q(s*.  t )  and  the  constant  function  1,  the 
second  term  above  is  not  larger  than  the  first  term.  Thus  £(s*)  <  00 .  Q.E.D. 

THEOREM  4:  The  optimal  strategy  s*  which  we  have  constructed  for  the  dual  problem  is 
optimal  for  the  original  problem,  too. 

PROOF:  If  the  statement  of  the  theorem  is  false  there  exists  a  strategy  s'  with  finite 
expected  search  effort  so  that  «  :-  £(s*)  -  £(s')  >  0.  Without  loss  of  generality  we  may 
assume  that  lim  s*(r.  k)  -  lim  s'(/,  k )  for  all  k.  We  prove  the  theorem  by  defining  a  strategy 

/— oo  f— 'OO 

s  €  S*  which  is  better  than  s*.  This  is  a  contradiction  to  Lemma  3.  Starting  with  s'  we  shall 
define  s  step  by  step. 

STEP  1 :  We  choose  T  €  R0+  so  that 

J  tdQis*.  •)  <  «/2 
<r°o| 

(this  is  possible  since  £(s*)  <  °°  by  lemma  4)  and  so  that  s*(T.k)  <  tx  or  s*(T,k)  >  r2  for 
each  triple  (t\,  t2.  k)  €  I  (this  is  possible  since  s*  €  S*).  Let  /'be  the  set  of  triples  where 
s*(T,  k)  >  t2. 

Let  s"(f.  A:)  :-s*(r,  Ac)  for  t  ^  Tand  for  t  <  T we  imitate  s’.  For  t  €  |0,  T]  we  define 
s  "(r,  k) :-  min|s*(T,  Ac),  s'W.  Ac)) 
where  we  choose  t’  in  such  a  way  that  £  s  "(/,  k)  -  t. 

kiB 
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E(s")  =  f  tdQis",  •)  +  J*  tdQ(s",'). 

lo.  ri  tr°°l 

The  event  that  the  search  effort  for  box  k  reaches  t  <  s*(T.  k)  if  we  use  s"  happens  not 
later  than  the  same  event  if  we  use  s'.  The  first  term  of  the  above  equation  for  £(s")  is  equal 
to  the  following  value.  If  we  find  the  object  in  box  k  with  search  effort  t  ^  s*(T,  k),  we  have 
to  pay  the  whole  search  effort  which  we  have  spent  using  s "  which  is  not  more  than  the 
corresponding  search  effort  if  we  use  s'.  If  we  find  the  object  after  we  have  spent  more  than 
s*(T,  k)  for  box  fc,  we  have  to  pay  nothing  which  is  also  not  more  than  the  corresponding 
search  effort  for  s'.  By  these  considerations  the  first  term  is  not  larger  than  Els').  The  second 
term  equals  ^  ^  tdQ(s*.  •)  and  is  smaller  thane/2.  Thus, 

£(s")  <  £ (s ')  +  e/2. 

STEP  2:  Since  £  -  ^  T  we  may  define  /  "£  /'  so  that  /"contains  only 

finitely  many  elements  and  so  that 

£  (/,-/,)<  e/2. 

We  apply  Lemma  2  and  the  Remark  to  s  "  for  each  triple  of  /  "  one  after  another.  The  resulting 
strategy  is  called  s  and 

£(I)  <  £(s")  <  £  (s')  +  e/2. 


STEP  3:  J  has  the  ltt.t2.k)  -  property  for  all  triples  of  /  except  perhaps  the  triples  of 
/*  /' -  /".  But  the  length  of  all  intervals  which  belong  to  triples  of  /*  is  altogether  very 

small. 


We  define  a  sequence  of  strategies  s0 . V  Let  “  t.  If  r*_|  is  already  defined  we 

define  sk  in  the  following  way.  For  each  triple  {fj.  t2.  k)£  I *  we  choose  the  least  point  of  time 
t'  where  s* _ i < / '.  k)  -  /,  and  a  point  of  time  t"  where  s* _ j ( r k)  *  r2-  Outside  the  intervals 
</'.  r")  we  do  not  change  s*_(.  Inside  these  intervals  we  change  s*_i  to  the  strategy  which 
corresponds  to  h0  (see  the  proof  of  Lemma  2).  That  means  we  search  at  first  only  box  k  for  a 
period  of  time  of  length  i2  -  t\ ,  and  for  t  €  lr'  +  t2  -  f(,  r  "1  let  s*(r.  k)  -  t2  and  sk(t.  k')  - 
sk-i(TU),  k')  for  A;'  ^  k  and  an  appropriate  7"(r)  so  that  £  skU.k")-t.  By  the  choice  of  t' 

we  don’t  destroy  any  (tt.  t2.  k)  -  property.  Let  s  :=  s„. 

By  our  procedure  s  is  obviously  an  element  of  S*.  The  event  that  the  search  effort  for 
box  k  reaches  t  if  we  use  s  happens  at  most  by 

£  (r2-r,)<«/2 

later  than  the  same  event  for  S  -  Jo-  Thus, 

£(s)  <  £(s)  +c/2  <  E(s')  +  e  £(s*) 

and  s  €  S*.  Because  of  this  contradiction  to  Lemma  3  we  have  proved  the  Theorem.  Q.E.D. 
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6.  CONCLUSION 

In  this  paper  we  have  solved  a  search  problem  where  the  search  space  is  discrete  but 
where  time  is  continuous,  i.e.,  at  each  point  of  time  we  may  find  the  object.  Here  we  like  to 
compare  our  results  with  results  for  some  similar  search  problems. 

Many  authors  have  investigated  a  discrete  search  problem  where  one  may  find  the  object 
only  at  the  end  of  some  definite  searches.  This  search  problem  has  been  solved  by  Wegener 
19).  These  results  were  useful  for  the  solution  of  our  search  problem  since  we  used  these 
results  especially  in  Sections  4  and  5.  In  this  discrete  search  problem  there  does  not  exist 
always  an  optimal  strategy  while  in  this  paper  we  could  prove  the  existence  of  an  optimal  stra¬ 
tegy  in  each  situation.  Finally,  we  like  to  mention  that  Kadane  and  Simon  [4]  obtained  for 
discrete  search  problems  similar  results  in  the  situation  where  only  finitely  many  searches  have 
a  positive  probability  of  success. 

Arkin  til  solved  a  search  problem  which  may  be  called  the  continuous  version  of  our 
search  problem.  The  hidden  object  is  a  point  of  R".  The  a-priori  probability  is  given  by  a  den¬ 
sity  function  p:  R"  —  R0+.  q(\  x)  is  for  each  point  .v  €  R"  the  probability  distribution  func¬ 
tion  for  the  detection  of  an  object  at  point  x.  (We  omit  the  measure  theoretical  assumptions.) 
A  strategy  s  has  to  prescribe  how  we  distribute  the  search  effort:  s  :  Ro  x  Rn  ~ '  R^  so  that 
s(\  x)  is  increasing  and 


for  all  r.  The  probability  of  detecting  the  object  until  the  point  of  time  t  amounts  to 
(Ms.  i)  -  J*  p(x)q(s(t,  x),  x )dx. 

R" 

The  aim  is  to  minimize  the  expected  search  effort  of  a  successful  search.  At  first  Arkin  solved 
the  dual  search  problem  ( p ,  q)  where  p  and  q(\  x)  is  the  smallest  concave  function 
nowhere  smaller  than  q(\  x).  The  methods  of  Arkin  and  our  methods  of  Sections  2  and  3 
resemble  each  other. 

But  if  we  drop  the  assumption  that  the  functions  <?(’•  k)  and  q(‘.x)  are  concave  the 
solution  of  the  continuous  search  problem  is  much  easier  than  the  solution  of  our  problem. 
Since  each  point  of  R"  has  the  measure  0,  s(\  x)  is  not  necessarily  continuous  while  for  our 
search  problem  s(\  k)  is  always  continuous.  This  is  the  reason  why  there  exists  an  optimal 
strategy  s  (for  the  continuous  dual  problem)  so  that  s(t,  x)  d  (fIt  r2)  for  each  triple 
(/,.  t2.  x)  €  /.  That  means  that  the  search  effort  jumps  over  the  crucial  intervals.  This  optimal 
strategy  has  the  same  expected  search  effort  whether  we  use  it  for  the  dual  or  the  original  prob¬ 
lem.  Since  q  ^  q  the  expected  search  efTort  of  each  strategy  is  for  the  original  problem  not 
smaller  than  for  the  dual  problem.  Thus,  this  optimal  strategy  (for  the  dual  problem)  is 
optimal  for  the  original  problem,  too.  This  result  is  similar  to  our  main  result.  Stone  ([7], 
Theorem  2.4.6)  gives  a  more  elegant  proof  for  the  results  of  Arkin.  Finally,  we  like  to  state  an 
important  difference  between  the  continuous  problem  and  our  problem.  We  call  a  strategy  s 
uniformly  optimal  if  for  each  /  >  0  the  allocation  a,  where 

a,(k) s(f.  k)  resp.  a,(x) s(r.  x) 

is  /-optimal.  While  for  the  continuous  problem  the  optimal  strategy  is  even  uniformly  optimal 
there  does  not  always  exist  a  uniformly  optimal  strategy  for  our  problem  if  the  functions 
q(\  k)  are  not  concave.  For  an  example  see  Stone  (7]  (Example  2.2.9). 
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Again  Stone  [7]  gives  an  example  (2.4.8)  that  even  for  the  continuous  search  problem 
there  may  not  exist  a  uniformly  optimal  strategy  if  the  functions  q(\  x)  do  not  have  to  be 
right-continuous. 

Altogether,  our  results  together  with  the  results  of  Arkin  [1]  and  Stone  [7],  guarantee  the 
existence  of  strategies  which  minimize  the  expected  search  effort  for  any  "reasonable"  search 
problem  involving  continuous  effort. 
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ABSTRACT 

A  cutting  plane  scheme  embedded  in  an  implicit  enumeration  framework  is 
proposed  for  ranking  the  extreme  points  of  linear  assignment  problems.  This 
method  is  capable  of  ranking  any  desired  number  of  extreme  points  at  each 
possible  objective  function  value.  The  technique  overcomes  storage  difficulties 
by  being  able  to  perform  the  ranking  at  any  particular  objective  function  value 
independently  of  other  objective  values.  Computational  experience  on  some 
test  problems  is  provided. 


1.  INTRODUCTION 

The  ranking  of  extreme  points  of  an  assignment  polytope  has  been  proposed  as  an  imple¬ 
mentation  tool  for  solving  various  types  of  programming  problems.  Cabot  and  Francis  13]  have 
developed  a  procedure  which  ranks  the  extreme  points  of  an  assignment  polytope  in  order  to 
generate  a  monotone  increasing  sequence  of  lower  bounds  for  solving  concave  minimization 
transportation  problems.  Fluharty  [6]  has  used  similar  ideas  for  solving  the  quadratic  assign¬ 
ment  problem.  As  reported  by  McKeown  (10],  this  scheme  works  well  principally  when  the 
linear  term  in  the  objective  function  numerically  dominates  the  nonlinear  term.  Murty,  Karel 
and  Little  (12]  have  recommended  the  solution  of  the  traveling  salesman  problem  through  the 
use  of  ranking  extreme  points  of  an  assignment  polytope.  McKeown  [8]  also  suggests  the  use 
of  such  a  scheme  for  solving  the  bottleneck  assignment  problem.  Sherali  [15]  has  proposed  the 
ranking  of  possible  objective  values  taken  on  by  a  linear  assignment  problem  in  the  process  of 
solving  the  quadratic  assignment  problem.  Furthermore,  in  some  applications,  a  linear  assign¬ 
ment  problem  may  need  to  be  solved  in  the  presence  of  complicating  side  constraints,  some  of 
which  may  possibly  be  qualitative  such  as  behavioral  constraints  [8].  Therefore,  the  ranking  of 
extreme  points  of  an  assignment  polytope  is  a  viable  approach. 
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As  a  result,  several  researchers  have  devised  schemes  for  ranking  the  extreme  points  of 
an  assignment  polytope.  Murty  (11]  has  proposed  a  method  for  the  general  linear  fixed  charge 
problem.  This  method  was  implemented,  in  vain,  by  Cabot  and  Francis  [3]  for  the  transporta¬ 
tion  and  the  assignment  polytopes.  Murty 's  procedure  is  essentially  based  on  the  result  that 

given  the  first,  second . /rth  ranked  extreme  points  of  a  linear  programming  problem,  the 

(k  +  l)st  ranked  extreme  point  is  geometrically  adjacent  to  at  least  one  of  these  k  extreme 
points.  In  the  absence  of  degeneracy,  the  simplex  method  may  be  used  to  maintain  a  list  of 
such  adjacent  extreme  points.  In  the  presence  of  degeneracy,  however,  one  would  also  need  to 
determine  all  basic  representations  of  a  degenerate  vertex  in  order  to  use  the  simplex  method. 
Rubin  [14]  has  shown  how  Chernikova’s  algorithm  (4,5]  may  be  modified  to  handle  the  degen¬ 
erate  case.  Further,  McKeown  and  Rubin  [9]  have  specialized  this  technique  for  ranking 
extreme  points  of  a  transportation  problem,  and  McKeown  [8]  has  exploited  the  special  struc¬ 
ture  of  the  assignment  polytope  to  further  specialize  this  scheme  for  he  linear  assignment 
problem. 

Basically,  the  principal  difficulty  with  these  schemes  arises  from  the  nature  of  the  assign¬ 
ment  polytope  itself.  Balinski  and  Russakoff  [2]  have  demonstrated  that  for  _n  assignment 

m-i  i  \ 

problem  of  size  m,  there  are  £=  um  (m  -  i  -  1)!  extreme  points  adjacent  to  any  given 

,-o  i '  I 

extreme  point.  For  m  =  8,  this  figure  is  16,064.  Moreover,  the  assignment  polytope  has  a 
diameter  of  two.  That  is,  given  any  two  extreme  points,  .v  and  r,  either  ,v  and  y  are  geometri¬ 
cally  adjacent  to  each  other,  or  there  exists  an  extreme  point  c  distinct  from  x  and  y  such  that  v 
and  r  are  geometrically  adjacent  and  so  are  y  and  The  implication  of  this  is  that  the  pro¬ 
cedures  of  Murty  [11],  Rubin  [14]  and  McKeown  and  Rubin  [9]  which  require  an  explicit  list¬ 
ing  of  adjacent  extreme  points  become  intractable  for  the  linear  assignment  problem  Even  the 
procedure  of  McKeown  [8],  which  uses  cost  considerations  in  order  to  eliminate  the  storage  of 
a  subset  of  edges  adjacent  at  an  extreme  point,  is  plagued  with  this  problem.  This  viewpoint  is 
further  supported  by  the  storage  overflow  problems  reported  by  Fluharty  [6]  and  McKeown 
[10], 


This  paper  proposes  a  procedure  for  ranking  the  extreme  points  of  an  assignment  polytope 
which  completely  avoids  the  search  over  adjacent  extreme  points  of  the  ranked  solutions.  The 
technique  employs  trivially  generated  cutting  planes  in  an  implicit  enumeration  framework,  and 
is  capable  of  ranking  any  number  of  extreme  points  at  any  feasible  value  for  the  assignment 
problem.  Thus,  in  particular,  it  can  rank  all  the  extreme  points  of  the  assignment  polytope,  or 
it  can  rank  the  objective  function  values  that  can  be  taken  on  by  the  linear  assignment  problem 
by  simply  ranking  only  one  extreme  point  at  each  objective  value,  or  it  can  rank  the  objective 
function  values  that  can  be  taken  on  by  the  linear  assignment  problem  by  simply  ranking  only 
one  extreme  point  at  each  objective  value,  or  it  can  rank  a  preset  number  of  extreme  points  at 
each  attainable  value  within  any  possible  range  of  objective  function  values.  Moreover,  the 
ranking  of  extreme  points  at  any  given  objective  value  is  independent  of  the  ranking  at  'Mher 
values,  and  hence  storage  problems  are  greatly  reduced. 

In  the  next  section,  we  discuss  the  proposed  scheme  and  then  illustrate  it  through  a 
numerical  example.  Finally,  we  present  computational  results  using  several  test  problems 
including  some  available  in  the  literature. 
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2.  A  TWO-PHASE  PROCEDURE  FOR  RANKING  THE  EXTREME  POINTS 
OF  AN  ASSIGNMENT  PROBLEM 


Consider  the  linear  assignment  problem 

(m  m 

X  X  cijxij  -  x  € 

/-i  j- 1 

where 

m 

(2.1)  XA  =  {jc  —  (xu . xmm)'  £  *//  =  1  for  each  J  =  •  •  •  » 

<-i 

m 

£  Jty  =  1  for  each  /  —  1 . /n, 

j- 1 

Xy  =  0  or  1  for  each  7=1,  . . .  ,  m}. 

Let  c'=  (cu . cmm),  where  a  superscript  /defines  the  transpose  operation,  and  let 

(2.2)  i/min  =  minimum  {c'x:  x  6  XA],  »/max  =  maximum  {c'x:  x  €  XA). 

Consider  a  ?  €  [i>min,  and  assume  that  an  x1  €  XA  is  known  such  at  c'x1  =  ?. 

In  order  to  demonstrate  the  capabilities  of  our  procedure  as  discussed  in  the  foregoing 
section,  we  will  show  how  one  may  rank  up  to  a  maximum  of  nmax,  say,  extreme  points  having 
an  objective  value  of  ?,  and  then  how  one  may  generate  another  extreme  point  of  the  assign¬ 
ment  polytope  having  the  smallest  objective  value  greater  than  ?,  in  case  v  <  i<max.  We  note 
that  a  typical  linear  assignment  problem  may  have  a  large  number  of  extreme  points  with  the 
same  objective  value  ?  in  the  range  lvmm.  vmax\.  The  following  scheme  which  iterates  between 
two  phases  is  designed  to  exploit  this  property. 


To  begin  with,  in  the  first  phase,  denoted  by  Phase  I,  all  pairwise  exchanges  of  facilities 
relative  to  the  assignment  x1  which  yield  the  same  objective  value  are  attempted.  Thus,  if 
Xy-  ~  xki  =  1  a°d  if  Cjj  +  ckl  =  c,i  +  ckj  then  an  alternative  solution  with  assignments  given  by 
x‘  except  that  the  locations  of  facilities  i  and  k  are  interchanged,  is  generated  and  stored  in  a 
list  L(v).  After  all  extreme  points  with  objective  value  V  obtainable  through  such  pairwise 
exchanges  on  x1  have  been  generated,  the  next  solution  x2,  say,  is  selected  from  the  list  Z.(D, 
if  such  a  solution  exists.  Again,  alternative  solutions  having  an  objective  value  v  which  are 
obtainable  through  pairwise  exchanges  on  x2  are  generated  and  are  stored  in  L  (?)  provided 
they  have  not  been  generated  through  pairwise  interchanges  on  previously  examined  solutions 
(x1  in  this  case).  This  process  is  continued  till  none  of  the  solutions  stored  in  L(v)  lead  to  any 
new  points  of  XA  having  a  value  of  ?.  At  this  point,  the  procedure  transfers  to  the  second 
phase,  Phase  II.  Of  course,  if  at  any  point  in  this  generation  scheme,  the  number  of  points  in 
L  (?)  equals  nmax,  then  we  may  terminate  further  ranking  of  solutions  having  a  value  ?. 

In  Phase  II,  an  attempt  is  made  to  determine  if  there  exists  any  point  in  XA  -  L  (?)  which 
has  an  objective  value  of?.  Toward  this  end,  consider  the  solution  x1  €  XA  and  define  the  set 

(2.3)  /,-{(/.  J):xj-  I). 

Now,  it  is  easy  to  see  that  the  cutting  plane  £  x,j  <  m  -  2  deletes  the  point  x\  but  no 

other  point  in  XA.  This  is  so,  because  every  other  point  in  XA  must  have  at  least  two  of  the 
variables  x0 ,  (/,  j)  €  /,,  equai  to  zero.  But,  if  exactly  two  variables  x0.  (/,  j)  €  /|  are  zero  in 
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a  given  solution,  then  this  solution  is  necessarily  obtained  from  x1  through  a  pairwise  exchange. 
Since  we  are  no  longer  interested  in  solutions  obtained  through  pairwise  exchanges  on  x1  in  our 
search  for  points  in  XA  of  value  ?,  we  may  impose  the  cut 

(2.4)  £  x„  ^  m  -  3 

<  1,/if /, 

For  subsequent  solutions  x2.  x3,  ...  in  the  list  L(?),  a  cut  of  the  type  (2.4)  needs  to  be  gen¬ 
erated  only  if  pairwise  exchanges  on  these  solutions  lead  to  other  solutions  in  the  list  L(?). 
Hence,  if  the  current  list  contains  n  points,  then  we  would  have  some  it  ^  n  cuts  of  the  type 

(2.4)  generated  from  solutions  xp  for  p  €  J,  where  J  is  an  appropriate  index  set  with  |/|  =  n. 
These  cuts  would  be  of  the  form 


(2.5) 


where. 


£  x„  <  m  -  3 

u.juir 


/,=  !</.  j):  xf,  =  1} 


for  each  p  (l  J. 


Note  that  the  cuts  (2.5)  need  not  be  explicitly  generated  and  stored;  it  is  sufficient  to 
merely  maintain  the  index  set  J.  The  reason  for  this  is  the  following.  Consider  an  enumerated 

extreme  point  xp  with  xpf>u)  =  1  for  /  =  1 . m,  where  p(i)  denotes  the  location  of  facility  i 

in  the  solution  xp.  Then  one  may  store  xp  in  the  list  L  (?)  as  simply  the  permutation  vector 

m 

p(l) . p(m).  Now,  if  p  €  7,  then  the  cut  (2.5)  is  clearly  £  x,ptl)  <  m  -  3.  Hence,  in 

i-i 

order  to  check  if  a  given  extreme  point  x9  with  •*&<<>  **  1  for  /  “  1 . m  violates  or  satisfies 

(2.5) ,  one  simply  needs  to  check  whether  or  not  the  cardinality  of  the  set  { i € { 1 . mV 

p{i )  =  <?(/)}  is  greater  than  m  -  3. 

Now,  in  addition  to  the  cuts  (2.5),  since  we  are  no  longer  interested  in  points  of  objective 
value  less  than  ?,  we  may  impose  the  cost  cut 

(2.6)  £  £  c.jX'j  Ss  v. 
i- 1  J- 1 

Let  us  next  define  the  set 

(2.7)  Q  =  |x:  £  x0  ^  m  -  3  for  each  p  €  J,  c'x  ^ 

1  | 

and  consider  the  following  problem 

FEAS  (F):  minimize  !c'x:  x  €  Q  ft  XA}. 


Clearly,  Problem  FEAS  (F)  will  either  generate  as  an  optimal  solution  a  point  in 
XA  -  L  (F)  having  a  value  v ,  ot  failing  which,  will  indicate  that  no  such  point  exists.  Thus,  let 
x  solve  FEAS  (?).  If  c'x  -  F,  then  we  may  set  x"+l  -  xand  transfer  to  Phase  I,  attempting  all 
pairwise  exchanges  starting  with  the  solution  xn+\  On  the  other  hand,  if  c'x  >  F  or  if  FEAS 
(?)  is  infeasible,  then  the  list  L(v)  is  complete,  and  we  will  say  that  we  have  completed  one 
iteration.  Now  observe  that  during  the  course  of  an  iteration  one  may  have  to  resort  to  several 
solutions  of  Problem  FEAS  (?)  with  different  sets  Q.  As  a  computational  expedient  for  solving 
these  problems  in  a  particular  iteration,  we  will  describe  an  implicit  enumeration  scheme  which 
is  initialized  only  once  during  the  first  visit  to  Phase  II  of  the  procedure,  and  is  simply  updated 
at  each  subsequent  visit  in  that  iteration. 
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The  fundamental  implicit  enumeration  scheme  we  use  is  due  to  GeolTrion's  [7]  adaptation 
of  Balas'  [1]  method.  Basically,  we  chose  this  depth-first  enumeration  scheme  because  of  the 
ease  and  the  minimal  storage  requirements  with  which  one  may  perform  all  the  bookkeeping 
operations  necessary  in  the  implicit  enumeration  procedure.  The  bookkeeping  here  involves 
simply  the  maintenance  of  a  partial  solution  list  which  indicates  in  chronological  order  which 
variables  are  currently  fixed  in  value  at  either  zero  or  one.  Thus,  a  partial  solution  list 
( -v 1 1  =  1,  v2?  =  0,  ,vi4  =  0,  ah  =  I)  would  indicate  that  the  variables  .vn,  v2,.  ,vu  and  ah  are 
currently  fixed  at  the  indicated  values,  and  moreover,  that  ,vn  was  fixed  at  one  before  v2,  was 
fixed  at  zero,  which  in  turn  occurred  before  it  was  decided  to  set  ,vi4  =  0,  and  so  on.  For  the 
assignment  problem,  this  feature  is  very  convenient  for  one  may  enforce  the  assignment  con¬ 
straints  by  merely  ensuring  that  no  facility  is  ever  assigned  to  more  than  one  location  and  vice 
versa  in  the  partial  solution  list.  In  other  words,  .v„  =  I  implies  xkl  =  .v(/  =  0  for  each  k, 

I  =  I . m,  k  ^  /,  /  ^  i.  Of  course,  if  a  facility  is  barred  from  each  location,  or  vice  versa. 

then  the  partial  solution  will  be  fathomed.  Thus,  for  example,  if  our  current  partial  solution  list 
is  (.Yu  =  1,  .vii  =  0,  .v,4  =  0,  .vjy  =  1),  we  would  never  consider  augmenting  this  list  by  setting 
.yji  =  1  or  .Yu  =  1,  since  .vn  =  1  implies  that  .v2)  =  0  and  that  ah  =  0.  Furthermore,  if  this 
partial  solution  list  is  always  incremented  each  time  by  assigning  a  facility  to  a  location,  then 
whenever  a  partial  solution  is  fathomed,  one  simply  locates  the  rightmost  variable  in  this  partial 
solution  list  which  is  fixed  at  value  one,  then  one  complements  the  value  of  this  variable  to 
zero,  and  deletes  all  fixed  variables  to  the  right  of  this  variable.  Hence,  a  new  partial  solution 
list  is  obtained.  Consequently,  consider  the  partial  solution  list  (.vI2  =  1,  ah  =  0,  ,yu  =  0, 
.v25  =  1,  An  =  0).  Note  that  since  the  partial  solution  list  is  always  augmented  by  suitably  set¬ 
ting  some  variable  equal  to  one,  each  of  the  variables  which  are  currently  zero  were  at  some 
previous  point  in  time  equal  to  one.  In  particular,  .y,|  was  equal  to  one  before  being  set  equal 
to  its  current  value  of  zero.  Observe  that  the  current  partial  solution  list  bars  facility  3  from  all 
locations  1,2 . 5  (assuming  m  =  5).  Thus,  this  solution  is  said  to  be  fathomed.  Conse¬ 

quently,  we  locate  the  rightmost  variable  in  the  list  which  is  currently  one.  namely  ah.  comple¬ 
ment  it  to  zero,  i.e.,  set  x2s  =  0,  and  delete  all  variables  to  its  right,  namely,  eliminate  .v,;  =  0 
from  the  list  in  this  case.  This  gives  us  the  new  partial  solution  list  (.vi2  -  1,  ah  =  0,  x,4  =  0, 
.v25  =  0).  As  proven  by  GeofTrion  [7],  this  procedure  would  result  in  a  nonredundant,  exhaus¬ 
tive,  implicit  enumeration  of  all  solutions. 

The  algorithmic  statement  below  gives  the  details  for  the  incrementing  and  the  fathoming 
operations.  Before  discussing  these  implementation  details,  we  draw  the  reader's  attention  to 
two  specific  points.  Firstly,  the  moment  a  feasible  solution  to  FEAS  (F)  of  value  F  is  detected, 
then  this  solution  necessarily  solves  FEAS  (F).  Secondly,  during  the  implicit  enumeration 
scheme,  we  maintain  an  incumbent  solution  x  of  value  F  whLh  is  updated  each  lime  an 
improved  solution  is  found,  except  if  this  mproved  solution  has  a  value  F.  This  ensures  that 
when  we  have  finally  enumerated  all  solutions  having  a  value  of  F,  and  have  consequently 
solved  the  final  problem  FEAS  (F),  the  incumbent  solution  value  is  equal  to  the  next  value 
larger  than  F,  taken  on  by  a  feasible  assignment,  if  such  a  value  exists.  A  detailed  description 
of  both  Phase  I  and  Phase  II  of  the  algorithm  is  given  below.  Here,  the  algorithm  finds  at  most 
"max  extreme  points  having  a  specified  objective  value  equal  to  F.  It  is  assumed  that  the  cost 
coefficients  c,/s  are  all  integer  valued. 

PHASE  I.  (Generate  an  initial  list  L( F)  using  pairwise  exchanges.) 

INITIALIZATION:  (This  step  is  performed  each  lime  a  new  iteration  is  being  com¬ 
menced;  otherwise,  the  procedure  transfers  to  Step  1  at  each  subsequent  visit  to  Phase  I.) 
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Suppose  that  \A  €  ,V4  is  given  with  c'.v1  =  v.  Let  v1  be  the  first  solution  in  a  list  Hi). 
Set  F  «  r  =  »  =  1.7  =  <6.  Proceed  to  Step  I. 

STEP  I:  Attempt  all  pairwise  exchanges  on  .v'  to  find  solutions  of  value  F  which  have  not 
as  yet  been  listed  in  HT’).  Store  these  solutions  by  adding  them  to  L  ( F ).  Further  if  either  this 
is  the  first  time  Step  1  is  being  executed  at  the  current  visit  to  Phase  1  or  if  pairwise  exchanges 
on  .v'  lead  to  new  alternative  solutions  of  value  F.  replace  7  by  7  U  lr).  In  any  case,  increment 
«.  if  necessary,  so  that  n  equals  the  total  number  of  extreme  points  in  the  list  HT).  If 

n  >  nmdX,  the  maximum  number  of  points  desired  to  be  ranked  at  the  value  T.  then  transfer  to 

Step  10  of  Phase  II,  Otherwise,  continue. 

STEP  2:  If  r  is  equal  to  w,  then  proceed  to  Phase  II.  Otherwise,  increment  r  by  one  and 
return  to  Step  1. 

PHASE  II.  (Solution  of  the  current  Problem  FEAS  (F)  .) 

INITIALIZATION:  (This  step  is  executed  only  on  the  first  visit  of  the  procedure  to 
Phase  II  during  a  particular  iteration.  At  all  subsequent  visits,  the  procedure  may  transfer 
directly  to  Step  3  since  the  previous  partial  solution  list  at  the  last  visit  to  Phase  II  is  also  valid 
at  this  stage.  The  reason  being  that  the  new  problem  FEAS  (F)  is  a  further  restriction  of  the 
previous  FEAS  (F),  and  the  manner  in  which  Steps  3  through  9  below  are  executed  ensures 
that  solutions  previously  fathomed  are  also  currently  inconsequential.) 

Initialize  the  partial  solution  vector  by  assigning  m  -  2  facilities  as  in  the  solution  v1.  Set 

F  =  °°.  Since  neither  of  the  two  completions  of  this  partial  solution  lead  to  a  new  solution  of 

value  F,  proceed  to  Step  6. 

STEP  3:  Find  the  feasible  completion  x  €  X4  of  the  partial  solution  veci  i  ♦  'itch  na.«v  .!• 
izes  the  value  of  c'x  through  the  solution  of  the  associated  linear  assignment  *>'7oiem.  Thai  is. 
solve  the  linear  assignment  problem  of  minimizing  c'x  subject  to  x  6  X4  given  that  the  assign 
ments  indicated  by  the  partial  solution  list  are  forced  a  priori.  (Note  that  by  virtue  of  Step  1 
below,  this  step  is  never  entered  with  fewer  than  three  unassigned  facilities.)  If  no  completion 
feasible  to  the  assignment  constraints  exists,  or  if  the  resulting  solution  x  satisfies  c'x  >  F, 
transfer  to  Step  6.  Otherwise,  continue. 

STEP  4:  Determine  if  x  €  Q,  where  Q  is  defined  by  Equation  (2.7).  Transfer  to  Step  7  if 
it  is  not  and  continue  otherwise. 

STEP  5:  If  c'x  —  F,  then  set  xr+l  =  x,  replace  r  by  r  +  1 ,  place  .v  in  L  (F)  and  go  to  Step 
1  of  Phase  I.  The  current  problem  FEAS  (F)  is  solved,  with  the  optimal  solution  value  being  F. 
Otherwise,  since  c'x  <  F  from  Step  3,  set  x  =  x  and  F  =  c'x  and  continue. 

STEP  6:  Fathom  the  current  partial  solution  by  finding  the  rightmost  variable  in  the  par¬ 
tial  solution  list  which  is  fixed  at  one,  complementing  it  to  zero,  and  deleting  (setting  free)  all 
fixed  variables  to  the  right  of  this  variable.  If  no  such  variable  exists,  go  to  Step  10  since  then 
the  current  problem  FEAS  (F)  is  solved  with  the  optimal  objective  function  value  being  F  >  F. 
Otherwise,  return  to  Step  3. 
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STEP  7:  If  more  than  three  facilities  have  not  as  yet  been  located  somewhere  in  the 
current  partial  solution,  proceed  to  Step  8.  Otherwise,  enumerate  the  feasible  completions  in 
XA  of  the  partial  solution.  Note  that  at  most  six  such  completions  exist.  If  either  there  are  no 
feasible  completions  or  if  all  the  feasible  completions  are  infeasible  to  Q,  then  transfer  to  Step 
6. 


If  any  feasible  completion  lies  in  Q  and  has  an  objective  value  i\  then  set  x'+l  equal  to 
this  solution,  place  x'"1  in  the  Z.(F),  replace  rby  r  +  1  and  transfer  to  Phase  I. 

If  any  feasible  completion  lies  in  Q  and  has  a  value  less  than  F,  then  pick  that  solution 
with  the  least  objective  value  and  update  x  and  F  by  replacing  them  with  this  solution  and  its 
objective  value  respectively  and  transfer  to  Step  6.  If  none  of  the  above  cases  hold,  simply 
transfer  to  Step  6. 

STEP  8:  Find  a  feasible  completion  x  €  XA  which  maximizes  the  value  of  c'x.  If 
c'x  <  F,  then  go  to  Step  6  since  then  clearly  there  exists  no  completion  of  the  current  partial 
solution  feasible  to  FEAS  (F).  Otherwise,  continue. 

STEP  9:  Let  x  €  XA  be  the  solution  found  in  Step  3.  Increment  the  current  partial  solu¬ 
tion  by  assigning  that  unassigned  facility  /  to  that  free  location  j  which  satisfies  x,(  -  1  and 
which  yields  the  smallest  value  of  ey  from  among  all  such  pairs  (/,  j).  If  there  are  more  than 
three  unassigned  facilities,  go  to  Step  3.  Otherwise,  go  to  Step  7. 

STEP  10:  The  list  Hr)  is  complete.  If  F  =  i'max,  the  procedure  terminates.  If  it  is  now 
required  to  generate  a  new  point  x*  6  XA  with  c'x*  =  r*  such  that  v’  is  the  next  greater  value 
taken  on  by  the  linear  assignment  objective  function  after  F.  perform  the  following  routine. 

First  check  if  F  -  F  +  1.  If  it  is,  then  necessarily,  x*  =  x,  r  *  —  F.  If  not,  then  using  the 
current  xas  an  incumbent  solution  of  value  0  and  setting  Q  -  (x:  c'x  >  r  +  1),  use  the  above 
implicit  enumeration  scheme  (Phase  II)  with  obvious  modifications  to  solve  the  problem  of 
minimizing  c'x  subject  to  x  C  XA  Cl  Q.  Then  x*  and  r*  are  the  optimal  solution  and  optimal 
objective  value  obtained,  respectively.  Again,  note  that  if  during  this  search,  a  solution  of 
value  F  +  1  is  detected,  then  this  is  an  optimal  solution.  Now,  replacing  F  by  t>*and  x1  by  x*, 
one  may  initiate  a  new  list  L  (F)  with  x1  as  the  first  solution  in  this  list  and  transfer  to  Phase  I. 

Observe  again  that  during  the  performance  of  Phase  II  if  any  solutions  are  fathomed,  then 
we  will  never  have  the  occasion  to  reexamine  these  solutions  during  subsequent  visits  to  Phase 
II  for  a  particular  iteration.  Hence,  the  implicit  enumeration  scheme  of  Phase  II  may  simply  be 
updated  at  each  visit.  For  this  reason,  the  incumbent  solution  x  is  not  updated  when  Phase  II 
detects  an  optimal  solution  to  FEAS  (F)  of  value  F. 

This  further  enables  one  to  possibly  avoid  extra  search  in  determining  the  solution  x*  in 
Step  10.  In  our  experience,  the  condition  F  -  F  +  1  often  held  at  Step  10.  Finally,  we  explain 
the  motivation  behind  Step  7.  Observe  that  the  cuts  of  type  (2.5)  are  such  that  if  at  least  three 
facilities  are  unassigned  in  a  partial  solution  then  for  each  of  these  cuts,  there  exists  a  comple¬ 
tion  (not  necessarily  feasible,  though)  which  will  satisfy  that  cut.  It  is  because  of  this  reason 
that  we  do  not  attempt  to  check  for  existence  of  feasible  completions  which  satisfy  these  cuts, 
until  three  facilities  remain  unassigned.  However,  we  do  check  at  Step  8  for  existence  of  feasi¬ 
ble  completions  which  satisfy  the  cut  c'x  ^  F. 
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3.  ILLUSTRATIVE  EXAMPLE 

Consider  a  linear  assignment  problem  with  m  =  4,  having  the  following  cost  matrix 


|l  2  2  l) 

For  this  problem,  =  4,  rmax  =  12.  Let  us  select  v  =  vmin  -  4  and  enumerate  all  possible 
solutions  with  this  objective  value.  Hence,  we  know  x1  =  (3, 1,2,4)  of  value  c'x1  -  v  =  4. 
(Note  that  an  assignment  solution  is  being  represented  as  a  permutation  vector 
(p(I).  P( 2) . p(m )),  where  pU)  satisfies  xip(n  —  1,  /  —  1 . m  for  that  solution.) 

ITERATION  1. 

PHASE  I 

INITIALIZATION:  Given  x1  =  (3, 1,2,4)  with  c'x'  —  v  —  4,  we  have,  L(i>)  — 
((3, 1,2,4)).  Set  v  **  oo,  r  =  n  =  1 ,  J  =  <i>. 

STEP  1:  No  further  solutions  are  generated.  Set  J  -  |l),  n  -  |y|  -  1  and  from  Equation 
(2.7),  Q  =  {x:  Jr,3  +  jr2|  +  xr32  +  x4A  <  1,  c'x  ^4). 

STEP  2:  Since  r  =  n  =  1,  proceed  to  Phase  II. 

PHASE  II. 

INITIALIZATION:  Here,  m  -  2-2.  Thus,  our  starting  partial  solution  list,  abbreviated 
by  PS  henceforth,  is  PS  -  (xu  -  1,  x21  -  1). 

STEP  6:  Fathom  PS.  This  gives  the  updated  PS  -  (x|3  —  1,  x2!  —  0). 

STEP  3:  Solve  the  assignment  problem  with  xi3  -  1,  x21  -  0  to  get  x  -  (3,4,2, 1)  with 
c'x  —  7  <  v  —  oo. 

STEP  4:  x  e  Q. 

STEP  7:  Currently,  PS  -  (x(3  =  1,  x2!  »  0).  Thus,  the  completions  are 
(3, 2,1,4)  9  Q 

(3. 2.4.1)  €  Q  of  value  8 

(3.4. 1.2)  €  Q  of  value  10 
(3,4, 2,1)  9  Q. 

Hence,  update  x  -  (3,2,4, 1)  with  i>  -  8. 

STEP  6:  Fathom  PS.  The  new  PS  —  (x(3  —  0). 

STEP  3:  Solve  the  assignment  problem  with  x)3  -  0  to  get  x- (2, 1,3,4)  with 
c'x  -  6  <  v  -  8. 
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STEP  4:  x  e  Q 

STEP  7:  Currently,  PS  -  (jcj3  —  0).  Thus,  ail  four  facilities  are  unassigned. 

STEP  8:  Putting  x13  -  0,  find  the  maximizing  assignment  solution  x.  Thus, 
c'x  —  12  <  F  “  4. 

STEP  9:  Examine  x  -  (2, 1,3,4)  found  in  Step  3  above.  Force  the  assignment  x2]  -  1. 
Hence,  the  new  PS  -  (x13  -  0,  x21  -  1). 

STEP  7:  Enumerate  the  completions  of  PS 

(2,1,3,4)  *  Q 

(2. 1.4.3)  €  Q  of  value  9 

(4. 1.2. 3)  €  Q 

(4, 1,3, 2)  €  Q  value  8 

STEP  6:  Fathom  PS.  The  new  PS  -  (x13  -  0,  x21  -  0). 

STEP  3:  Solve  the  assignment  problem  with  X|3  -  0,  x2|  -  0  to  get  x  -  (1,2,3, 4)  with 
c'x  -  8  >  0  -  8. 

STEP  6:  Since  PS  —  (x13  —  0,  x21  —  0)  currently,  fathoming  PS  leads  to  a  termination  of 
the  present  iteration. 

STEP  10:  L( 4)  —  {(3, 1,2, 4)},  a  singleton.  Now,  to  find  x *  €  XA  of  Step  10,  since 
v  -  8  >  v  +  1  -  5,  we  let  Q  -  (x:  c'x  >  5}  and  using  x  -  (3,2,4, 1),  we  construct  a  starting 
partial  solution  list  PS  -  0r13  -  I,  x22  —  1).  Verifying  that  the  best  completion  of  this  list  is 
indeed  xof  value  v  -  8,  we  proceed  to  Step  6. 

STEP  6:  Fathom  PS.  The  new  PS  -  (x)3  —  1 ,  x22  -  0) . 

STEP  3:  With  X|3  -  1,  x22  -  0,  the  linear  assignment  problem  yields  x  -  (3,4,2, 1)  with 
c'x  —  7  <  v  —  8. 

STEP  4:  x  €  Q  of  Step  10  above. 

STEP  5:  Update  x  s  x  -  (3,4,2. 1),  v  -  c'x  -  7. 

STEP  6:  Fathom  PS.  The  new  PS  -  (xl3  -  0). 

STEP  3:  With  x13  -  0,  solve  the  linear  assignment  problem  to  get  x  -  (2, 1,3,4)  with 
c'x  -  6  <  v  -  7. 

STEP  4:  x  €  Q  of  Step  10  above. 

STEP  5:  Update  x  s  x  -  (2, 1,3,4),  t  -  c'x  -  6. 

STEP  6:  Fathom  PS.  ’i.  Nis  to  a  termination  of  the  operation  of  finding  the  next 
larger  objective  value.  Hence,  v*of  Step  10  are  x*  =  x  -  (2, 1,3,4)  and  v*  =  v  «-  6. 

One  may  now  enumerate  solutions  of  value  6  if  so  desired. 
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4.  COMPUTATIONAL  EXPERIENCE 

For  the  purpose  of  computational  testing,  we  will  use  the  following  test  problems.  The 
problem  of  size  m  =  4  is  the  illustrative  example  of  Section  3.  The  problem  of  size  m  -  3  has 
the  following  cost  matrix 

3  2  1 
1  2  1 
2  1  2 

Problems  of  sizes  m  =  5,  6,  7,  8,  12  and  15  have  as  their  cost  matrices  those  corresponding  to 
the  linear  assignment  problems,  the  extreme  points  of  which  need  to  be  ranked  by  Cabot  and 
Francis’  [3]  procedure  applied  to  the  quadratic  assignment  test  problems  of  Nugent,  Vollmann 
and  Ruml  [131  of  respective  sizes  m  =  5,  6.  7,  8,  12  and  15.  Problems  of  sizes  25,  50,  75,  and 
100  have  their  cost  data  generated  as  indicated  below. 

The  procedure  was  coded  in  FORTRAN  IV  and  the  computational  times  reported  are  in 
c.p.u.  seconds  of  execution  time  on  a  CDC  Cyber  70  Model  74-28/CDC  6400  computer  in 
time-sharing  mode  under  normal  batch  operation  with  compilation  performed  using  the  OPT  = 
1  option.  Table  4. 1  gives  the  experience  for  a  total  ranking  of  all  extreme  points  of  the  assign¬ 
ment  polytope,  that  is,  with  wmax  =  m!.  Table  4.2  gives  the  experience  for  simply  ranking  the 
objective  function  values  taken  on  by  the  linear  assignment  problem,  that  is,  with  wmax  =  1. 
Table  4.3  gives  results  for  ranking  at  most  500  extreme  points  at  each  objective  value  assumed 
by  the  linear  assignment  problem,  so  that  nmax  =  500.  Table  4.4  deals  with  ranking  at  most  50 
extreme  points  at  the  first  10  objective  function  values  attainable.  The  data  for  these  problems 
is  generated  according  to  c0  =  (y)mod  10.  Finally,  Table  4.5  gives  some  computational  testing 
of  problems  with  cost  coefficients  uniformly  generated  at  random  in  the  interval  [0,  100], 

Note  that  during  any  iteration,  each  visit  to  Phase  II  results  in  only  one  additional 
extreme  point  being  enumerated  except  for  the  last  visit,  when  no  extreme  point  is 
enumerated.  In  the  tables  below,  the  column  designated  "a"  records  the  average  number  of 
passes  through  Phases  I  and  II  per  iteration.  Hence,  the  total  number  of  extreme  points  gen¬ 
erated  in  Phase  II  is  given  by  (a-I)  (Number  of  iterations). 


TABLE  4.1  —  Ranking  of  All  Extreme  Points  (nmax  *=  w!) 


m 

^min 

^max 

Number  of 
Iterations 

a 

b 

c.p.u.  Seconds  of 
Execution  Time 

3 

3 

7 

3 

1.00 

■TflVTBMfnaUH 

0.018 

4 

■ 

12 

8 

2.25 

1(4),  1(6),  3(7),  7(8),  3(9), 
4(10),  4(11),  1(12) 

0.477 

5 

50 

55 

6 

2.833 

20(50),  14(51),  28(52), 

28(53),  16(54),  14(55) 

1.378 

6 

82 

84 

2 

1.00 

480(82),  240(84) 

66.677 

1 

137 

144 

8 

1.125 

48(137),  384(138),  912(139), 
1344(140),  1200(141), 
768(142),  288(143),  96(144), 

981.556 

a  —  Average  number  of  passes  through  Phases  1  and  II  per  iteration 
b  —  Number  of  extreme  points  ranked  with  values  v  in  parentheses  at  each  iteration 
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TABLE  4.2  —  Ranking  of  AH  Objective  Function  Values  («max  =  1) 


m 

^min 

4'max 

Number  of 
Iterations 

a 

Execution  Time  in  c.p.u  Seconds 

min  time  per 
iteration 

max  time  per 
iteration 

Total 

time 

3 

3 

7 

3 

1.00 

0.002 

0.004 

0.008 

4 

4 

12 

8 

1.00 

-0.000 

0.058 

0.092 

5 

50 

55 

6 

1.00 

-0.000 

0.026 

0.042 

6 

82 

84 

2 

1.00 

0.002 

0.101 

0.103 

7 

137 

144 

8 

1.00 

-0.000 

0.196 

0.443 

8 

186 

199 

14 

1.00 

-0.000 

1.723 

5.670 

12 

493 

517 

25 

1.00 

-0.000 

1.622 

5.700 

15 

963 

1034 

72 

1.00 

-0.000 

4.320 

26.631 

a  —  Same  connotation  as  for  Table  4.1. 


TABLE  4.3  —  Partial  Ranking  of  Extreme  Points  at  each 
Objective  Value  with  nmM  —  500 


m 

^min 

*'max 

Number  of 
Iterations 

, 

c.p.u.  Seconds  of 
Execution  Time 

7 

137 

144 

8 

1.125 

166.142 

8 

186 

199 

14 

1.00 

250.906 

12 

493 

517 

25 

1.00 

201.315 

15 

963 

1034 

72 

1.00 

624.209 

a  —  Same  connotation  as  for  Table  4.1 


TABLE  4.4  —  Partial  Ranking  of  Extreme  Points  at  the 
First  10  Objective  Values  with  rrmax  =  50 


m 

•'min 

^max 

a 

Execution  Time  in  c.p.u  Seconds 

min  time  per 
iteration 

max  time  per 
iteration 

Total 

time 

25 

21 

1 83 

1.00 

0.034 

0.288 

0.900 

50 

40 

365 

1.00 

0.026 

0.102 

0.522 

75 

61 

548 

1.00 

0.032 

0.354 

1.046 

100 

80 

730 

1.00 

0.042 

0.154 

0.828 

a  —  Same  connotation  as  for  Table  4.1. 
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TABLE  4.5  —  Partial  Ranking  of  Extreme  Points  of  Problems  with  Cost  Coefficients 
Uniformly  Generated  at  Random  in  the  Interval  [0,  100] 


Number  of 

Execution  Time  in  c.p.u  Seconds 

Iterations 
and  nmaI 

m 

v  mtn 

^  max 

a 

min  time  per 
iteration 

max  time  per 
iteration 

Total 

time 

Number  of 

25 

99 

2356 

1.00 

-0.00 

22.46 

46.47 

Iterations  =  10 

50 

107 

4805 

1.00 

-0.00 

15.91 

33.29 

75 

108 

7297 

1.00 

-0.00 

15.25 

33.15 

^max  1 

100 

135 

9794 

1.00 

-0.00 

51.27 

105.19 

Number  of 

25 

99 

2356 

3.00 

2.63 

23.59 

66.20 

Iterations  =  5 

50 

107 

4805 

4.20 

24.55 

98.21 

273.23 

75 

108 

7297 

4,00 

81.68 

184.47 

585.87 

"max  =  5 

100 

135 

9794 

3.00 

60.38 

171.23 

609.47 

Number  of 
Iterations  =  10 

"max  ~  ml 

25 

99 

2356 

6.00 

2.25 

56.06 

282.07 

Number  of 
Iterations  =*=  5 

"max  “  ml 

25 

99 

2356 

3.20 

2.22 

25.95 

70.03 

a  —  Same  connotaiion  as  for  Table  4  1 
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ABSTRACT 

The  integer  programming  literature  contains  many  algorithms  for  solving 
all-integer  programming  problems  but,  in  general,  existing  algorithms  are  less 
than  satisfactory  even  in  solving  problems  of  modest  size.  In  this  paper  we 
present  a  new  technique  for  solving  the  all-integer,  integer  programming  prob¬ 
lem.  This  algorithm  is  a  hybrid  lie.,  primal-dual)  cutting-plane  method  which 
alternates  between  a  primal-feasible  stage  related  to  Young's  simplified  primal 
algorithm,  and  a  dual-infeasible  stage  related  to  Gomory’s  dual  all-integer  algo¬ 
rithm.  We  present  the  results  of  computational  testing. 


In  this  paper,  we  describe  an  algorithm  which  we  call  the  Constructive  Primal-Dual  Algo¬ 
rithm  (CPDA),  for  solving  IP  problems  of  the  form  (P)  below. 


n 


(1) 

maximize 

*0  ~  £  CJXJ  =  0 
] 

(P) 

(2) 

subject  to 

I  a,Jx1  b„  /  -  1 . m 

y-i 

(3) 

Xj  ^  0  and  integer,  j  —  1 . n 

(4) 

where  cJt  aijt  b,  are  integer  for  all  i,  j. 

Glover  [2]  developed  a  "Pseudo  Primal-Dual"  integer  programming  algorithm  (PPDA)  in 
which  the  pure  integer  programming  problem  is  solved  in  two  stages,  systematically  violating 
and  restoring  dual  feasibility  while  maintaining  an  all-integer  tableau  (PPDA  is  the  only 
primal-dual  algorithm  reported  in  the  literature).  This  algorithm  starts  with  a  dual  feasible 
tableau  similar  to  that  of  Gomory  [3j.  A  cut  constraint  is  generated  from  the  row  with  the  larg¬ 
est  number  of  negative  elements,  and  dual  simplex  is  then  applied.  If  this  stage  does  not  des¬ 
troy  dual  feasibility,  then  the  process  is  repeated  until  an  optimal  solution  is  reached.  Other¬ 
wise,  dual  feasibility  is  restored  by  a  sequence  of  "pseudo-primal"  pivot  steps  using  the  column 
that  is  lexicographically  most  negative  when  divided  by  the  corresponding  coefficient  in  the 
source  row  (restricting  attention  to  positive  coefficients),  and  the  pivot  row  from  the  dual  stage. 

The  CPDA  alternates  between  a  primal-feasible  stage  related  to  Young's  SPA  (71,  and  a 
primal-and  dual-infeasible  stage  related  to  Gomory’s  dual  all-integer  algorithm  (3).  The  CPDA 
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departs  from  Young’s  technique  in  its  choice  of  the  cut  row.  This  algorithm  solves  (P)  by  usu¬ 
ally  starting  primal-feasible  and  dual-infeasible  (if  (P)  is  a  maximization  problem  with  all 
constraints),  as  does  Young’s  SPA.  Whenever  a  stationary  cycle  is  encountered,  the  CPDA 
avoids  degenerate  iterations  by  developing  a  cut  which  deliberately  moves  into  the  infeasible 
region,  and  then  attempts  to  return  to  primal  feasibility  at  a  better  solution  point  than  the  one 
from  which  it  departed. 

1.  THE  CONSTRUCTIVE  PRIMAL-DUAL  ALGORITHM 

This  section  presents  an  outline  and  a  discussion  of  the  CPDA.  Two  similar  versions  of 
the  algorithm  are  presented  (CPDA-1  and  CPDA-2),  and  are  used  in  solving  two  distinct 
classes  of  integer  programming  problems.  We  employ  the  Beale  tableau,  explicitly  carrying  only 
the  nonbasic  columns  and  the  constraint  vector. 

CPDA-1  can  be  used  in  solving  most  standard  IP  problems;  i.e.,  those  IP  problems  that 
are  not  highly  primal-  or  dual-degenerate.  Algorithmic  steps  of  the  CPDA-1  are  as  follows. 

Primal  Stage 

STEP  1:  Write  the  IP  in  form  (P).  If  some  b,  <  0  then  go  to  the  dual  stage.  Otherwise, 

check  to  determine  whether  c,  >  0  for  j  —  1,2 . /r,  if  so,  stop;  the  current  basis  is 

optimal;  otherwise,  go  to  Step  2. 

STEP  2:  Select  the  column  ak  which  is  lexicographically  the  most  negative  as  the  pivot 
column.  If  all  components  of  ak  are  negative,  stop,  the  value  of  the  objective  function  is 
unbounded.  Otherwise,  select  as  the  source  row  that  fir  for  which  b,/ a,k  is  a  minimum,  and 
alk  >  0.  Break  ties  by  arbitrary  selection.  If  b,  >  ark  go  to  Step  4;  otherwise,  go  to  Step  3. 

STEP  3:  In  column  ak  search  for  a  new  row  /3,  such  that  [br/ark]  is  the  smallest  ratio 
greater  than  or  equal  to  1,  where  ark  >  0,  and  where  {  ]  signifies  the  greatest  integer  part.  If 
no  such  row  fir  exists  then  row  /3,  from  Step  2  is  selected  as  the  source  row.  Go  to  Step  4. 

STEP  4.  Construct  the  cut-constraint  as  follows: 

(5)  £  \arJark)  (/,)  +  sc  -  ll >r/ark). 

j- 1 

where  sc  is  a  nonnegative  integer  variable  called  the  "cut-slack.'’  Append  constraint  (5)  to  the 
bottom  of  the  current  tableau.  Go  to  Step  5. 

STEP  5:  Perform  a  simplex  iteration  by  pivoting  on  the "  + 1"  (the  pivot  element)  in  the 
bottom  row  and  in  the  selected  pivot  column  ak.  The  slack  variable  sc  in  (5)  becomes  a  non¬ 
basic  variable.  After  the  iteration  is  completed,  discard  the  bottom  row.  If  some  bt  <  0  in  the 
current  tableau,  then  go  to  the  dual  stage.  If  all  cy  >  0,  stop;  the  current  basis  is  optimal.  Oth¬ 
erwise,  go  to  Step  2. 

Dual  Stage 

STEP  6:  If  all  b,  ^  0  and  c;  >  0,  stop;  an  optimal  solution  has  been  obtained.  Other¬ 
wise,  go  to  Step  7. 
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STEP  7:  Select  the  source-row  /3,  as  the  row  for  which  r  >  1,  and  r  -  min  {/ :  b,  <  0). 
Go  to  Step  8. 

STEP  8:  If  a..  >  0  for  all  j  >  2,  stop;  (P)  has  no  feasible  solution.  Otherwise,  consider 

1  i 

all  columns  aj  such  that  arJ  <  0.  Choose  ak  such  that  ck  <  0  and  ak  <  a,  for  all 
7  =  1,2 . n,  where  "<"  represents  the  lexicographically  smallest  column,  and 


If  there  is  no  such  ck  <  0  then  go  to  Step  9;  otherwise  go  to  Step  10. 

STEP  9:  For  the  source-row  /3r  from  Step  7  and  for  all  arj  <  0,  select  the  entering 

column  ak  such  that  ak  >  ctj  if  and  only  if  ck  >  0.  If  there  is  no  Cj  >  0,  then  consider  a} 
such  that  Cj  =  0,  for  arJ  >0.  Go  to  Step  10. 

STEP  10:  Construct  a  dual  cut  with  cut-constant  | a , J ,  generating  a  cut  with  a  pivot  ele¬ 
ment  of  "-1."  Append  the  constraint  (5)  to  the  bottom  of  the  tableau.  Go  to  Step  11. 


STEP  11:  Perform  a  simplex  iteration  by  pivoting  on  the  1"  in  the  bottom  row  and  in 
the  selected  pivot  column  a*.  The  slack  variable  sc  in  (5)  becomes  a  nonbasic  variable.  After 
the  iteration  is  completed,  discard  the  bottom  row.  If  some  b,  <  0,  then  go  to  Step  6,  other¬ 
wise  go  to  the  primal  stage  (Step  2). 


We  modified  CPDA-1  to  solve  a  special  class  of  IP  problems  which  are  highly  primal-  and 
dual-degenerate  (e  g.,  fixed-charge  IP  problems).  A  modified  version  of  CPDA-1  (designated 
herein  as  CPDA-2)  involves  the  following  changes  to  the  original  algorithm. 


In  Step  2  of  CPDA-1,  if  b,  >  ark  for  some  source-row  /3,  then  go  to  Step  4.  Otherwise, 
we  substitute  a  procedure  for  Step  3  which  constructs  a  cut  from  the  objective  function  of  the 
current  tableau,  using  the  largest  (least  negative)  negative  c,  as  the  "cut  constant."  Inequality 
(6)  is  used  to  construct  the  cut-constraint  (7);  that  is: 


(6) 

(7) 


X  CjXj  >  *0  +  1, 

/-I 

]£  [c,/cj  (lj)  +  sc  =  [(x0  +  l)/cj, 
im  i 


where  x0  is  the  current  value  of  the  objective  function.  Since  the  resulting  tableau  is  primal- 
infeasible,  we  proceed  to  Step  6. 


Note  that  all  the  other  steps  (with  the  exception  of  Step  3)  remain  the  same  in  this  ver¬ 
sion  of  CPDA. 


Discussion  of  CPDA 

At  this  point,  it  is  useful  to  discuss  in  more  detail  some  of  the  algorithmic  steps  presented 
in  the  previous  section. 
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In  Step  2  of  CPDA-1,  if  brl  ark  >  1  then  the  next  iteration  will  be  a  transition  cycle;  oth¬ 
erwise  a  normal  primal  iteration  would  generate  a  stationary  iteration  (since  ark  >  br).  By 
imposing  the  "next  higher  integer  rule,"  the  CDPA  is  actually  attempting  to  avoid  a  stationary 
cycle  whenever  possible  by  transforming  the  primal-feasible  problem  into  a  primal-  and  dual- 
infeasible  one.  The  "next  higher  integer"  rule  is  actually  the  essence  of  the  basic  CDPA  algo¬ 
rithm.  It  is  at  this  stage  of  the  algorithm  that  CPDA  departs  from  Young's  SPA. 

In  Step  8,  the  effect  is  to  generate  the  strongest  possible  cut  in  the  primal-dual-infeasible 
tableau.  Experience  with  many  test  problems  has  shown  that  primal  feasibility  may  be  reached 
much  faster  if  the  ck/ark  ratio  is  the  smallest  possible  ratio  for  ail  ck  and  ark  >  0.  In  most  non¬ 
degenerate  problems  (in  a  dual  sense),  this  rule  is  very  effective.  However,  for  highly  dual¬ 
degenerate  problems,  CPDA-1  is  usually  less  efficient.  In  Step  9,  the  CPDA  is  attempting  to 
avoid  dual-degenerate  iterations  whenever  possible.  That  is,  when  zero  prices  are  present, 
CPDA  refuses  to  use  those  columns  as  pivot  elements;  instead  it  searches  for  a  c,  >  0  combi¬ 
nation. 

The  major  difference  between  the  CPDA-1  and  CPDA-2  is  in  the  use  of  the  "Objective 
cut"  rule.  That  is,  instead  of  selecting  the  next  higher  integer  ratio  of  b,!a,k  (where  b,  >  ark), 
whenever  the  next  iteration  would  normally  be  a  stationary  cycle,  a  cut  constructed  from  the 
current  objective  function  is  appended  to  the  bottom  of  the  tableau,  and  the  RHS  value 
replaced  by  a  "—1".  The  effect  is  that  CPDA-2  is  constraining  the  current  value  of  the  objective 
function  to  be  at  least  one  unit  higher,  and  challenging  the  primal-feasibility  state  of  the  prob¬ 
lem.  Two  conditions  may  result;  (a)  the  tableau  will  remain  primal-feasible,  and  generate 
further  transition  cycles,  increasing  the  objective  function  value;  or  (b)  the  algorithm  will  enter 
a  primal-dual-infeasible  stage  and  attempt  to  regain  primal-feasibility  status  by  generating  dual 
cuts.  At  this  point  the  tableau  may  indicate  that  there  is  no  feasible  solution  to  the  problem,  by 
recognizing  that  in  at  least  one  of  the  constraints,  b,  <  0  and  all  a„  >  0.  In  that  case,  the 
computation  is  terminated  with  the  most  recent  primal-feasible  tableau  being  the  optimal  solu¬ 
tion  of  the  IP  problem.  However,  if  this  condition  does  not  occur,  the  CPDA  continues  with 
the  primal-dual-infeasible  stage  of  the  problem  until  a  primal-feasible  tableau  is  reached.  This 
process  is  repeated  until  an  optimal  solution  is  obtained. 

2.  RESULTS  OF  COMPUTATIONAL  TESTING 

A  set  of  nineteen  test  problems  reported  by  Haldi  [41,  and  solutions  given  by  Trauth  and 
Woolsey  [5]  and  Wahi  and  Bradley  [6],  are  used  for  computational  testing.  These  problems  are 
categorized  by  Trauth  and  Woolsey  as  follows. 

(a)  ten  fixed-charge  problems  of  Haldi;  and 

(b)  nine  allocation  problems  of  Trauth  and  Woolsey. 


Trauth  and  Woolsey  report  that  each  problem  set  has  been  selected  to  show  a  different 
facet  of  the  generalized  integer  linear  programming  problem.  The  problems  are.  as  a  rule, 
rather  small;  however,  their  difficulty  is  illustrated  by  the  fact  that  three  of  the  smallest  did  not 
converge  with  four  of  five  commercial  codes  after  15,000  iter->tions. 

The  nine  allocation  problems  are  used  to  investigate  the  sensitivity  of  some  integer  linear 
problems  to  a  relatively  minor  change  in  the  problem  matrix.  Each  allocation  problem  consists 
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of  ten  decision  variables,  one  major  constraint,  and  the  upper  bound  constraints  for  each  of  the 
decision  variables.  All  nine  problems  are  identical  in  their  structure  except  for  the  right-hand- 
side  values.  The  ten  decision  variables  are  of  the  0-1  type. 


The  ten  fixed-charge  problems  were  chosen  because  of  the  difficulty  of  solution  in  spite  of 
their  small  size.  These  problems  are  difficult  chiefly  because  their  initial  tableaus  are  heavily 
primal-  and  dual-degenerate. 


Analysis  of  Computational  Results 

Results  of  computational  testing  for  ten  "fixed-charge"  problems  are  exhibited  in  Table  1. 
The  "fixed-charge"  problems  are  interesting  and  unusually  difficult  to  solve,  because  their  start¬ 
ing  tableaus  are  heavily  dual-  and  primal-degenerate.  The  performance  of  CPDA-2  is  compared 
to  that  of  seven  other  algorithms.  Incidently,  the  authors'  version  of  Young's  SPA  did  not 
converge  for  any  of  the  twenty-five  test  problems;  it  is  therefore  excluded  from  further  analysis 
in  the  following  "computational  results"  tables. 

TABLE  1  —  Results  of  Computational  Testing  for  Fixed-Charge  Problems 


Commercial  Codes' 

BDA 

IPM3 

LIP1 

IL2-1 

1LP2-2 

1PSC 

12 

54 

24 

135 

36 

32 

14 

81 

15 

94 

47 

45 

10 

37 

26 

154 

104 

56 

12 

91 

18 

93 

18 

22 

107 

+  7000 

158 

+  7000 

+  7000 

6104 

83 

+  7000 

123 

+  7000 

311 

3320 

107 

+  7000 

159 

+  7000 

+  7000 

+  7000 

83 

+  7000 

126 

+  7000 

306 

+  7000 

44 

118 

42* 

+  7000 

289 

339 

879 

1396 

102* 

+  7000 

+  7000 

+  7000 

On-Site  Tests 


Code 

Problems  # 


CPDA-2 

Optimality  Feasibility 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


3 

3 

3 

6 

7 

7 

7 

7 

64 

4* 


7’ 

7* 

7* 

8* 

8* 

8* 

8* 

8* 

65 

NA 


Source:  (5) 

'Best  number  of  iterations 

“Best  useful  solution  (within  20%  of  optimum)  — no  convergence  in  10  minutes  of  CPU  time 


For  every  fixed-charge  problem  under  CPDA-2  in  Table  1,  two  different  results  are 
reported:  (a)  the  number  of  iterations  after  CPDA-2  has  attained  optimality,  and  (b)  the 
number  of  iterations  after  CPDA-2  has  reached  an  infeasible  state.  For  example,  in  fixed- 
charge  problem  #1  CPDA-2  reached  the  optimum  solution  in  only  three  iterations.  Then,  in 
order  to  determine  whether  the  current  solution  is  feasible  as  well,  it  proceeded  with  the  algo¬ 
rithmic  process  until,  in  the  seventh  iteration,  it  reached  an  infeasible  state.  Therefore,  in  this 
example,  only  three  iterations  were  required  by  CPDA  to  solve  problem  #1,  but  seven  were 
required  to  verify  that  fact. 

For  problems  #1  through  #8,  CPDA-2  converged  in  less  than  eight  iterations,  which  is  a 
remarkably  consistent  and  encouraging  performance.  It  is  worth  noting  that  CPDA-2  reached 
the  "near-optimum"  solution  for  these  problems  almost  instantly.  For  fixed-charge  problem  #9, 
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CPDA-2  was  only  outperformed  by  Austin’s  Bounded  Descent  Algorithm  [1]  and  the  commer¬ 
cial  code  LIP  1 ,  although  the  difference  in  number  of  iterations  does  not  appear  to  be  large.  For 
the  fixed-charge  problems,  from  an  overall  point  of  view,  CPDA-2  reduced  the  number  of 
iterations  significantly.  Next  to  CPDA-2,  Austin’s  BDA  showed  an  interesting  superiority  over 
all  the  other  IP  codes  examined. 

The  results  of  computational  testing  for  the  nine  allocation  problems  are  exhibited  in 
Table  2.  Since  the  allocation  problems  are  "normal"  integer  programming  problems  (i.e.,  they 
are  not  heavily  primal-  and  dual-degenerate),  the  usefulness  of  CPDA-2  is  limited.  Therefore, 
the  basic  form  of  CPDA,  i.e.,  CPDA-1  was  used  in  solving  these  problems.  CPDA-1  was 
slightly  outperformed  by  BDA  and  one  of  the  commercial  IP  codes  (IPM3).  However,  in  gen¬ 
eral,  the  number  of  iterations  appears  to  have  the  same  order  of  magnitude  for  these  algo¬ 
rithms.  One  reason  for  this  poorer  performance  by  CPDA-1  is  that  is  it  a  primal-dual  algo¬ 
rithm.  That  is,  problems  are  solved  in  two  distinct  stages  in  which  the  algorithm  searches  for 
both  optimality  and  feasibility  conditions,  thus  requiring  a  relatively  larger  number  of  iterations 
in  reaching  the  final  solution.  However,  this  difficulty  did  not  deter  the  CPDA  from  outper¬ 
forming  several  of  the  "one  stage"  IP  codes  tested.  For  instance,  in  almosi  all  the  cases,  the 
performance  of  CPDA-1  can  be  ranked  no  worse  than  three  out  of  a  possible  seven  (in  fact  for 
problems  #7  and  #8  CPDA  gave  the  best  result). 


TABLE  2  —  Results  of  Computational  Testing  for  Allocation  Problems 


On-Site  Tests 

Commercial  Codes' 

Code 

Problems  # 

CPDA-1 

BDA 

IPM3 

LIP1 

IL2-1 

1LP2-2 

IPSC 

1 

19 

54 

51 

46 

2 

55 

163 

77 

64 

3 

30 

41 

168 

59 

71 

4 

30 

12* 

18 

19 

192 

48 

62 

5 

30 

3* 

11 

12 

139 

32 

50 

6 

45 

35 

18* 

40 

157 

54 

81 

7 

38* 

395 

61 

81 

504 

119 

131 

8 

21* 

38 

21* 

51 

370 

57 

102 

9 

66 

2* 

12 

12 

20! 

34 

44 

'Source.  15) 

’Best  number  of  iterations 


3.  CONVERGENCE  PROPERTIES 

As  regards  t'  e  existence  of  a  mathematical  proof  of  convergence  for  the  CPDA,  two  dis¬ 
tinct  stages  must  be  considered  — primal-feasible  convergence  and  primal-dual-infeasible  conver¬ 
gence.  For  the  first  case,  two  conditions  may  occur:  (a)  the  next  iteration  in  the  solution  pro¬ 
cess  is  a  transition  cycle,  in  which  case  the  convergence  proof  will  be  similar  to  that  for 
Young’s  SPA  [7];  or  (b)  the  next  iteration  in  the  solution  process  is  a  stationary  cycle,  in  which 
case  the  CPDA  will  require  the  use  of  the  "next  higher  integer  rule,"  which  would  in  fact  neces¬ 
sitate  a  convergence  proof  of  the  CPDA  for  the  first  stage.  No  formal  mathematical  proof  has 
been  obtained  for  this  condition.  Fortunately,  the  second  stage  of  CPDA  (i.e.,  the  primal- 
dual-infeasible  case)  is  very  similar  to  that  of  Glover's  PPDA  [2]  for  which  a  convergence 
proof  is  available  in  the  literature. 
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On  the  surface,  there  would  seem  to  be  the  possibility  that  the  CPDA  might  leave  the 
feasible  region  and  reenter  at  the  same  point  from  which  it  departed— thus  cycling  and  failing  to 
converge.  However,  during  dual  iterations,  the  feasible  region  is  diminished  by  the  dual  cuts, 
so  that  the  CPDA  would  encounter  a  different  problem  upon  reentry  than  it  departed  from  — 
even  though  it  re-entered  at  the  same  lattice  point.  This  fact  is  undoubtedly  the  key  to  a  for¬ 
mal  convergence  proof  for  the  CPDA. 


4.  SUMMARY  AND  CONCLUSIONS 

The  CPDA  is  appealing  for  several  reasons:  (a)  in  primal  algorithms,  artificial  variables 
must  be  used  when  the  constraints  (2)  of  (P)  include"^"  or"="  constraints,  and  the  presence 
of  such  variables  usually  causes  a  significant  increase  in  computational  effort,  (b)  for  highly 
degenerate  problems,  existing  primal  algorithms  frequently  encounter  cycles  which  involve 
thousands  of  stationary  iterations;  (c)  in  all  current  dual  algorithms,  the  objective  function  (1) 
of  (PI  must  include  only  c,  ^  0,  or  must  be  transformed  into  this  form  by  finding  upper 
bounds  on  the  associated  variables;  (d)  Glover's  PPDA  [6]  (the  only  other  primal-dual  algo¬ 
rithm  known  to  exist)  can  only  solve  minimization  problems  with  some”>"  and/or  "**"  con¬ 
straints,  and  finally  (e)  the  main  drawback  of  all  dual-based  algorithms  including  Glover's 
PPDA  is  their  inability  to  produce  a  "useful”  solution  before  the  integer  optimal  solution  is 
reached.  CPDA  has  this  capability. 

As  regards  computational  efficiency,  CPDA  performed  well  in  solving  highly  primal-  and 
dual-degenerate  fixed-charge  problems.  By  incorporating  an  objective  function  cut,  CPDA 
manages  to  delete  many  primal-dual-degenerate  iterations,  and  eventually  reaches  the  optimal 
solution  in  a  small  number  of  iterations.  On  the  allocation  problems,  CPDA  performed  con¬ 
sistently,  but  did  not  dominate  the  other  algorithms— as  was  the  ca^e  for  the  fixed-charge  prob¬ 
lems. 


The  general  conclusion  drawn  from  this  research  regarding  the  computational  efficiency  of 
the  CPDA  (in  comparison  with  other  IP  algorithms)  may  be  summarized  as  follows. 

(a)  Computation  time  is  code  dependent; 

(b)  Although  an  integer  problem  may  be  easily  solved  by  one  IP  code,  another  code  may 
prove  a  complete  failure; 

(c)  CPDA  can  obtain  a  "useful"  solution  if  the  computation  must  be  stopped  prematurely. 
This  is  an  important  characteristic,  since  dual-based  algorithms  (and  Glover's  hybrid  PPDA)  do 
not  have  such  a  feature  No  one  code  is  uniformly  better  for  all  types  of  integer  problems, 
although  CPDA  appears  to  perform  extremely  well  on  a  set  of  difficult  fixed-charge  problems. 


Two  Caveats 

First,  we  should  note  here  that  the  standard  test  problems  used  in  our  computational 
testing— although  difficult— are  rather  small.  We  are  currently  in  the  process  of  randomly  gen¬ 
erating  larger  problems  for  testing  the  algorithm,  but  preliminary  results  indicate  that  such 
problems— although  larger— are  relatively  easy. 
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Second,  we  should  acknowledge  chat  branch-and-bound  techniques  for  IP  are  generally 
considered  superior  to  cutting-plane  approaches.  The  problem  with  comparing  these  two  tech¬ 
niques  is  that  the  criterion  of  interest  in  our  study  was  the  number  of  simplex  iterations,  rather 
than  CPU  time.  Moreover,  for  ease  of  programming,  the  algorithm  is  written  in  SAS,  which 
has  remarkable  matrix/vector  handling  capabilities,  but  is  relatively  inefficient  in  the  sense  of 
CPU  time.  We  intend  to  further  refine  the  algorithm  (e  g.,  to  incorporate  advanced  starts)  and 
to  reprogram  it  in  PL-1.  Once  this  is  accomplished,  we  will  he  able  to  make  comparisons 
regarding  solution  efficiency,  by  comparing  our  results  with  those  obtained  by  IBM’s  MPSX, 
which  is  a  branch-and-bound  algorithm. 
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ABSTRACT 

This  paper  addresses  the  problem  of  computing  the  expected  discounted  re¬ 
turn  in  finite  Markov  and  semi-Markov  chains  The  objective  is  to  reveal  in¬ 
sights  into  two  questions.  First,  which  iterative  methods  hold  the  most  prom¬ 
ise?  Second,  when  are  iterative  methods  preferred  to  Gaussian  elimination? 
A  set  of  twenty-seven  randomly  generated  problems  is  used  to  compare  the 
performance  of  the  methods  considered.  The  observations  that  apply  to  the 
problems  generated  here  are  as  follows  Gauss-Seidel  is  not  preferred  to  Pre- 
Jacobi  in  general.  However,  if  the  matrix  is  reordered  in  a  certain  way  and  the 
author's  row  sum  extrapolation  is  used,  then  Gauss-Seidel  is  preferred 
Transforming  a  semi-Markov  problem  into  a  Markov  one  using  a  transforma¬ 
tion  that  comes  from  Schweitzer  does  not  yield  improved  performance  A 
method  analogous  to  symmetric  successive  overrelaxation  (SSOR)  in  numerical 
analysis  yields  improved  performance,  especially  when  the  row-sum  extrapola¬ 
tion  is  used  only  sparingly.  This  method  is  then  compared  to  Gaussian  elimina¬ 
tion  and  is  found  to  be  superior  for  most  of  the  problems  generated 


1.  INTRODUCTION 

This  paper  addresses  the  problem  of  computing  the  unique  A-vector  v*  that  satisfies 
v*  =  r  +  Pv*,  where  r  is  a  given  A- vector  and  P  is  nonnegative  and  has  a  spectral  radius  less 
than  one.  The  primary  intended  application  of  this  problem  is  finding  the  expected  discounted 
return  in  infinite  horizon,  stationary,  finite  state,  discrete  time  parameter,  Markov  and  semi- 
Markov  chains,  although  there  are  other  interesting  applications.  For  instance,  this  problem 
arises  in  solving  Leontief  input-output  problems  (Gale  [8])  and  when  determining  the  value 
associated  with  a  given  basis  when  optimizing  over  Leontief  substitution  systems  (see  Veinott 
(341  and  Koehler,  Whinston,  and  Wright  (141).  It  also  arises  when  using  finite  element  and 
finite  difference  approximations  to  partial  differential  equations  (see  Varga  (33),  Young  (371, 
Reid  (261,  and  Fox  (7)).  In  the  latter  case,  P  is  usually  symmetric  and  much  of  the  modern 
advanced  numerical  analysis  that  deals  with  this  problem  exploits  that  fact,  and  is  therefore  not 
applicable  to  the  Markov  or  semi-Markov  chain  case,  since  P  is  rarely  symmetric  then.  Per¬ 
tinent  discussion  of  this  problem  in  the  context  of  Markov  and  semi-Markov  chains  can  be 
found  in  references  (21,  (4-61,  (12-141,  (16],  119-25],  (30-321,  (35),  and  136], 

The  objective  of  this  paper  is  to  shed  light  on  two  questions.  First,  which  iterative 
methods,  among  a  number  of  candidates,  offer  the  most  promise?  Second,  when  are  iterative 
methods  preferred  to  Gaussian  elimination?  A  set  of  twenty-seven  Markov  chain  problems  is 
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generated  and  used  to  compare  the  various  methods  identified.  There  are  nine  problems  with 
50,  100,  and  200  rows  each.  The  rules  used  for  generating  the  problems  are  described  briefly  in 
the  appendix.  The  next  section  introduces  some  notation  and  sections  3-6  deal  with  the  first 
question,  regarding  the  promise  of  certain  iterative  methods.  Section  7  deals  with  the  second 
question  and  section  8  summarizes  the  conclusions  of  the  paper. 

2.  NOTATION 

We  use  the  notation  and  conventions  of  Porteus  122],  so  only  a  partial  exposition  of  them 
is  given  here.  The  /.«,  norm  is  used  exclusively.  The  iterative  methods  we  consider  can  be  put 
into  the  form  v"—  r  +  P\ '’~l  when  no  extrapolations  are  used,  where  r  and  P  may  well  be 
induced  from  an  implicit  transformation.  The  tildes  are  suppressed  whenever  clarity  ! 
preserved.  We  assume  that  P  >  0  andp(P)  <  1.  Let  a  denote  the  vector  of  row  sums  of  P , 
a  the  minimum  row  sum,  and  a  the  maximum  row  sum.  In  the  Markov  chain  case,  a  —  a, 
and,  in  the  semi-Markov  chain  case,  a  <  a.  The  sequence  {v")  converges  geometrically  at  the 
rate  p  if  0  <  p  <  1  and  there  exists  a  real  number  M  such  that  ||v"-  v*||  <  /8"A/for  all  n. 
The  sequence  converges  geometrically  at  the"  rat f  P  if  it  converges  geometrically  at  the  rate  y  for 
all  y  >  P.  The  subradius  of  P  is  denoted  by  p  *(P). 

3.  COMPARISON  OF  PRE-JACOBI  AND  GAUSS-SEIDEL  ITERATION  FOR 

MARKOV  CHAIN  PROBLEMS 

It  is  well  known  that  if  Pre-Jacobi  iteration  starts  with  v1  >  v°  (so  that  vn+1  ^ 
v*  ^  ...  >  v°)  then  Gauss-Seidel  iteration  yields  v"  >  v"  for  all  n ,  where  |v")  denotes  the 
sequence  generated  by  Gauss-Seidel.  A  common  interpretation  of  this  result  is  that  Gauss- 
Seidel  is  faster  than  Pre-Jacobi.  We  shall  argue  that  this  interpretation  is  misleading  at  best. 
Starting  with  a  specified  e  >  0,  the  object  of  an  iterative  method  here  is  to  obtain  an  estimate 
v„t  of  v*  such  that  llvesl-  v*||/||v*||  <  e  is  guaranteed.  (We  used  e  -  10"6  in  all  of  the 
numerical  work  discussed  in  this  paper.)  A  stopping  approach  consists  of  a  technique  for  con¬ 
structing  ves,  from  the  data  generated  by  the  method  and  a  rule  for  stopping  the  computations 
that  guarantees  llvesl-  v*||/||v*||  <  «. 

There  are  two  important  alternative  approaches  that  we  wiP  discuss.  The  first  uses  v”  as 
the  estimate  of  v*  and  stops  when  ||v"-  v"-1  i |/| !v"| |  ^  e(l  -a)/[a(l  +«)].  This  rule 
derives  from  the  classical  bound 

llv"  -  v*||  <  ll/MI  •  ||v*-  V-MI/U  -  IIPII), 

since  ||/*||  —  a. 

The  second  approach  selects 

Vest"  v*  +  a(a„  +  b„)/[2(l  -  a)], 

where  an :  -  min  (v*—  v"-1)  and  b„ max  (v,*-  v"-1)  and  stops  when  b„  -  a„  <  2«(1  -  a) 

llveJI/laO  +  e)].  This  rule  derives  from  the  bounds  of  Porteus  [24],  It  improves  on  the  first 
because  b„  -  a„  <  2| | v"  -  v"_1||  always  holds,  often  with  a  substantial  difference  since 
v"  >  v"-1  means  that  a„  >  0. 

At  this  point,  one  might  well  ask  what  the  above  has  to  do  with  a  comparison  between 
Pre-Jacobi  and  Gauss-Seidel.  If  the  first  stopping  approach  is  used,  then  clearly 
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1 1  v"  —  v  *  1 1  ^  1 1  v"  —  v  *  1 1,  giving  favor  to  Gauss-Seidei.  However,  if  the  second  approach  is 
used,  then  favor  can  swing  to  Pre-Jacobi.  For  Markov  chain  problems,  Pre-Jacobi  is  equivalent 
to  relative  value  iteration,  which  converges  geometrically  at  the  "rate"  p*(P).  (The  equivalence 
holds  only  when  the  second  stopping  approach  is  used.)  Gauss-Seidei  converges  geometrically 
at  the  "rate"  p(P).  where  P  denotes  the  implied  transition  matrix.  Thus,  if  p*(P)  <  p{P), 
which  is  very  possible,  Pre-Jacobi  has  a  better  guaranteed  speed  of  convergence  than  does 
Gauss-Seidei. 

To  assess  the  extent  to  which  the  observations  above  are  borne  out  in  examples,  we 
applied  both  Pre-Jacobi  and  Gauss-Seidei  with  each  of  the  stopping  approaches  mentioned  to 
the  twenty-seven  problems  generated.  For  Gauss-Seidei,  the  second  stopping  approach  used 
the  appropriate  specialization  of  the  results  in  Porteus  [24):  v"  >  v""1  implies  that  the  selection 
is 

vest  ”  v"  +  .5  [aaj(\  -  a)  +  abj(\  —5)1. 

which  guarantees  that  ||vesl  —  v * 1 1  ^  ,5[aii„/(l  -  a)  -  aa„/(l  -  a)].  The  trivial  scalar  extra¬ 
polation  was  used  after  each  iteration  of  Pre-Jacobi  to  reduce  roundoff  errors.  Doing  so  has  no 
effect  on  the  results  in  the  absence  of  roundoff  errors.  However,  we  could  not  use  that  extra¬ 
polation  when  applying  Gauss-Seidei  because  it  affects  the  results.  Indeed,  it  can  yield  a  diver¬ 
gent  method.  Provided  the  minimum  and  maximum  row  sums  of  the  Gauss-Seidei  matrix  are 
not  too  close  (which  was  the  case  in  all  of  our  examples),  the  effect  of  roundoff  errors  is 
significantly  smaller  than  the  effect  when  Pre-Jacobi  is  applied.  Thus,  the  comparison  is  not 
significantly  affected  by  the  fact  that  measures  were  taken  to  control  the  effect  of  roundoff 
errors  for  Pre-Jacobi  but  none  were  for  Gauss-Seidei. 

The  results  are  summarized  in  Table  1.  Some  of  the  detailed  results  appear  later  in  Table 
4,  but  comprehensive  details  are  omitted  here  because  there  was  so  little  variation  in  the  results 
within  each  method.  The  results  show  that  when  the  first  stopping  approach  is  used,  Gauss- 
Seidei  is  preferred  to  Pre-Jacobi,  but  that  the  opposite  is  the  case  with  the  second  approach. 
Indeed,  under  the  second  approach,  Pre-Jacobi  always  required  fewer  iterations  with  the 
differences  ranging  from  6,  14,  and  17  up  to  48.  Incidentally,  using  Gauss-Seidei,  the  second 
stopping  approach  always  required  4  or  5  fewer  iterations  than  the  first. 

TABLE  1  —  Number  of  Iterations  Required  by  Pre-Jacobi  Iteration  and 
Gauss-Seidei  Iteration  Using  Two  Alternative  Stopping  Approaches, 
for  Twenty-Seven  Problems 


Stopping 

Approach 

Iterative 

Method 

Number  of  Iterations 

Minimum 

Median 

Maximum 

First 

PJ 

112 

123 

128 

GS 

60 

66 

83 

Second 

PJ 

15 

32 

72 

GS 

56 

62 

78 

To  see  that  the  differences  can  become  more  pronounced  when  the  row-sums  are  close  to 
one,  the  probabilities  in  the  nine  problems  with  50  rows  each  were  scaled  up  so  that  their  row- 
sums  changed  from  .9  to  .99,  and  the  four  combinations  were  applied.  Pre-Jacobi  with  the 
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second  stopping  approach  required  approximately  twice  as  many  iterations  whereas,  not  surpris¬ 
ingly,  the  other  three  combinations  required  more  than  ten  times  as  many.  The  second  stop¬ 
ping  approach  will  be  used  exclusively,  henceforth. 

4.  THE  EFFECT  OF  REORDERING  AND  EXTRAPOLATIONS 

The  previous  section  gives  examples  in  which  Pre-Jacobi  is  preferred  to  Gauss-Seidel. 
However,  no  reorderings  or  extrapolations  were  carried  out  in  these  examples  (except  that 
which  was  done  to  control  roundoff  error).  In  Porteus  and  Totten  [25]  and  Porteus  [221,  evi¬ 
dence  was  given  that  both  reordering  and  extrapolating  can  have  a  beneficial  effect  when 
Gauss-Seidel  is  used.  Therefore,  both  of  these  modifications  were  applied  to  the  twenty-seven 
test  problems.  The  minimum  remaining  row-sum  (reordering)  method  and  the  row-sum  extra¬ 
polation  were  used.  Both  were  introduced  in  Porteus  [221,  where  they  outperformed  their 
identified  competitors  on  the  numerical  examples  considered.  The  numerical  results  here  are 
summarized  in  Table  2.  To  attempt  to  make  the  comparisons  between  methods  fair,  the  results 
are  reported  in  terms  of  normalized  (Pre-Jacobi)  iterations,  which  account  for  the  additional 
multiplications  and  divisions  needed  (such  as  for  the  extrapolations).  Some  of  the  detailed 
results  appear  in  Table  4. 


TABLE  2  —  Median  Number  of  Normalized  Iterations  Required  by  Gauss- 
Seidel  with  and  without  Reordering  and/or  Extrapolating 


Iterative 

Method 

Reordering? 

Extrapolating? 

Number  of  Rows 

50 

100 

200 

Overall 

PJ 

40 

30 

32 

63 

62 

60 

■it 

43 

36 

35 

36 

GS 

Hal' 

■l^v 

30.7 

29.6 

29.3 

30.6 

GS 

Yes 

17.8 

18.4 

20.3 

18.5 

’Except  for  control  of  roundoff  error,  as  previously  indicated. 


The  results  show  that  both  reordering  and  extrapolating  provide  beneficial  effects  on 
Gauss-Seidel  for  the  examples  considered  Indeed,  when  both  modifications  were  used, 
Gauss-Seidel  outperformed  Pre-Jacobi  in  every  instance,  reversing  the  order  of  preference 
observed  in  the  previous  section  This  preference,  for  Gauss-Seidel  (with  reordering  and  extra¬ 
polating)  over  Pre-Jacobi,  appears  to  be  valid  even  when  the  row-sums  are  very  close  to  one,  as 
it  was  observed  in  the  numerical  examples  in  Porteus  [221  in  which  row-sums  were  .9,  .99, 
.999,  and  .9999 

5.  THE  IMPLIED  EQUAL  ROW  SUM  TRANSFORMATION 

This  transformation  converts  a  semi-Markov  chain  problem  (with  unequal  row-sums)  into 
an  equivalent  Markov  chain  problem  (with  equal  row-sums).  It  was  introduced  by  Schweitzer 
[27],  discussed  by  Porteus  [211  and  by  van  Nunen  and  Stidham  [321  with  a  computational  con¬ 
text  in  mind,  applied  by  Lippman  [15]  to  queuing  optimization  models,  and  studied  by  Serfozo 
[29]  with  some  general  Markov  decision  processes  in  mind. 
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It  is  called  the  implied  equal  row-sum  transformation  here  because,  computationally,  it  is 
desirable  to  implement  it  in  the  following  manner.  Assume  without  loss  of  generality  that 
P„  =  0  for  all  /.  Let  w,  :  =  (1  -  a)/(l  —  a,),  where  a,  and  a  are  as  defined  in  Section  2.  The 
resulting  iterative  method  (corresponding  to  applying  Pre-Jacobi  in  conjunction  with  the 
transformation)  is  as  follows: 

v"  =  r  +  Pv"-' 

v,"  =  h^v"  +  (1  —  w,)  v"-1  for  all  i. 

The  implied  row  sums  are  all  equal  to  a. 

If  one  starts  with  an  unequal  row-sum  problem,  then  Pre- Jacobi  will  converge  at  the  "rate" 
of  the  radius  (of  P)  whereas  if  the  implied  equal  row-sum  transformation  is  applied,  yielding  an 
induced  transition  matrix  P,  then  Pre-Jacobi  will  converge  at  the  "rate"  of  the  subradius  of  P 
Thus,  it  is  not  necessarily  clear  that  this  method  will  converge  faster  than  Pre-Jacobi  applied  to 
the  original  problem. 

To  observe  the  potential  of  this  method,  it  was  applied  to  two  sets  of  twenty-seven  semi- 
Markov  chain  problems.  Both  sets  were  derived  from  the  original  twenty-seven  Markov  chain 
problems.  Each  time  Gauss-Seidel  is  applied  to  a  Markov  chain  problem  using  a  particular  ord¬ 
ering  of  the  states,  there  is  an  induced  transition  matrix  with  unequal  row-sums.  Such  a  transi¬ 
tion  matrix  can  be  interpreted  as  representing  a  semi-Markov  chain  problem.  The  method  dis¬ 
cussed  in  this  section  is  then  applied  to  that  problem.  The  first  set  of  semi-Markov  chain  prob¬ 
lems  derived  from  applying  Gauss-Seidel  to  the  original  matrices  and  the  second  set  from 
applying  Gauss-Seidel  to  the  reordered  matrices.  In  the  first  set,  the  implied  equal  row-sum 
(IERS)  method  reduced  the  median  number  of  normalized  iterations  from  62  to  47.1.  How¬ 
ever,  the  figures  are  36  and  64.9,  respectively,  for  the  second  set.  A  partial  explanation  of  this 
reversal  in  performance  may  involve  a  -  a,  the  difference  between  the  maximum  and 
minimum  row  sums.  When  this  difference  is  large,  IERS  entails  large  implied  diagonal  entries, 
which  correspond  to  a  large  subradius.  In  the  first  set  of  problems,  this  difference  was  rela¬ 
tively  small,  with  a  median  value  of  .19,  whereas  it  was  larger  in  the  second  set  of  problems, 
where  the  median  was  .31.  Thus,  it  seems  doubtful  that  IERS  will  yield  improved  results  in 
general.  Indeed,  when  the  row  sum  extrapolation  was  used  instead  of  IERS,  the  median 
number  of  normalized  iterations  was  reduced  from  47.1  to  30.6  for  the  first  set  of  problems  and 
from  64.9  to  18.5  for  the  second.  However,  IERS  may  yield  improved  results  in  problems  in 
which  a  —  a  is  quite  small.  Furthermore,  in  semi-Markov  decision  problems,  it  may  well  be 
valuable  in  selecting  policies  during  the  early  stages  of  an  algorithm,  since  it  equalizes  transition 
times  and  reduces  the  sensitivity  of  the  policy  improvement  step  to  the  estimate  of  the  optimal 
return  function. 

6.  REVERSIBLE  GAUSS-SEIDEL 

Reversible  Gauss-Seidel  (RGS)  is  simply  an  analogous  method  to  what  is  called  SSOR 
(symmetric  successive  overrelaxation)  in  the  numerical  analysis  literature  (see  Varga  [33]  or 
Young  [37],  for  example).  It  consists  of  using  Gauss-Seidel  in  a  specified  order  during  the  first 
pass  of  an  iteration  and  in  the  reverse  order  during  the  second  pass.  An  important  advantage  of 
this  method,  as  presented  by  Conrad  and  Wallach  [I],  is  that  with  modest  additional  storage,  a 
substantial  number  of  operations  can  be  saved.  For  instance,  if  the  original  matrix  has  an  equal 
number  of  nonzero  entries  in  its  lower  and  upper  triangular  parts,  then  the  second  pass  requires 
only  about  one-half  of  a  normalized  iteration.  Furthermore,  if  no  extrapolation  is  made  after 
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an  iteration,  the  first  pass  of  the  subsequent  iteration  also  requires  only  about  one-half  of  a  nor¬ 
malized  iteration.  (If  an  extrapolation  is  made,  then  the  first  pass  of  the  next  iteration  requires 
a  full  normalized  iteration.) 

When  the  row  sum  extrapolation  was  used,  RGS  required  a  median  of  24.3  normalized 
iterations  over  the  standard  twenty-seven  problems  and  16.7  when  the  matrix  was  reordered. 
Note  that  each  iteration  requires  roughly  Vk  normalized  iterations,  because  an  extrapolation  is 
made  after  each  iteration.  If  an  extrapolation  is  likely  to  have  little  effect  on  the  current  iterate, 
it  might  be  useful  to  skip  the  extrapolation,  so  that  the  next  iteration  would  require  merely  one 
normalized  iteration.  A  modification  based  on  this  idea  was  tried.  Let  v"  denote  the  mh  iterate 
and  vn+l  the  result  after  applying  the  two  passes  of  the  iteration,  before  any  extrapolation.  The 
rule  used  was:  if 

min  (v,"  +  1  -  v")  <  0  <  max  (v"+l  -  v,"), 

i  i 

then  do  not  extrapolate.  This  modification  brought  the  median  number  of  normalized  iterations 
down  to  19.3,  and  to  15.5  for  the  reordered  matrices,  which  represented  the  best  performance 
obtained  over  all  methods  tested  during  this  study.  Detailed  results  appear  in  Table  4. 

7.  COMPARISON  OF  ITERATIVE  METHODS  AND  GAUSSIAN  ELIMINATION 

ft  is  well  known  that  iterative  (indirect)  methods,  such  as  Pre-Jacobi  iteration,  take  less 
computational  effort  to  obtain  a  practical  solution  to  our  problem  than  do  direct  methods,  such 
as  Gaussian  elimination,  as  long  as  the  number  of  rows  is  sufficiently  large,  provided  that  P  is 
fully  dense  ( pu  >  0  for  all  /  and  j).  It  is  plausible  to  speculate  that  the  result  is  true  when  P  is 
sparse  as  well.  Our  numerical  results  will  support  this  claim.  Indeed,  they  will  give  insight  into 
the  approximate  combination  of  problem  size  and  density  for  which  an  iterative  method  is  pre¬ 
ferred  to  Gaussian  elimination. 

The  effectiveness  of  Gaussian  elimination  is  significantly  affected  by  the  rule  used  for 
selecting  the  pivot  elements.  A  rule  which  "has  proved  to  be  satisfactory  over  a  very  wide 
range  of  problems,"  according  to  Reid  (261,  is  the  one  proposed  by  Markowitz  (17).  It  itera¬ 
tively  selects  as  the  next  pivot  element  one  that  satisfies  a  numerical  stability  criterion  and 
minimizes  the  product  of  nonzeroes  in  its  column  (over  other  unanalyzed  rows)  and  nonzeroes 
in  its  row  (over  other  unanalyzed  columns).  In  general,  all  nonzeroes  in  the  unanalvzed  subma¬ 
trix  (the  matrix  composed  of  unanalyzed  rows  and  columns)  are  considered  as  possible  pivot 
elements.  In  each  of  the  problems  generated  here,  the  original  matrix  /-Pis  a  diagonally  dom¬ 
inant  M-matrix  (see  Varga  (33)  and  Young  137]  for  convenient  discussions  of  A/-matrices  in  the 
context  of  iterative  methods).  A  result  by  Fan  [3]  and  an  observation  by  Meijerink  and  van  der 
Vorst  [18]  guarantee  that  after  pivoting  on  any  diagonal  element  of  a  diagonally  dominant  M- 
matrix,  the  remaining  unanalyzed  submatrix  will  also  be  a  diagonally  dominant  A/-matrix. 
Thus,  only  diagonal  entries  were  allowed  to  be  pivot  elements  in  our  numerical  work.  This  rule 
guarantees  that  numerical  stability  will  be  maintained  regardless  of  which  diagonal  element  is 
selected  as  the  pivot  element  and  eliminates  the  need  to  compute  the  indicated  product  of 
nonzeroes  for  potential  pivot  elements  except  for  the  diagonal  elements. 

The  object  of  the  numerical  work  in  this  section  is  to  compare  Gaussian  elimination  with 
the  best  found  iterative  method,  which  was  reversible  Gauss-Seidel  with  the  matrix  reordered 
and  using  the  row  sum  extrapolation  occasionally,  as  specified  by  the  rule  discussed  in  the  pre¬ 
vious  section.  For  convenience,  we  refer  to  this  method  simply  as  RGS  in  this  section. 
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With  the  accuracy  criterion  used  to  stop  RGS  (see  Section  3)  and  the  computer  used, 
both  RGS  and  Gaussian  elimination  yielded  results  of  comparable  accuracy  To  be  conservative 
on  the  side  of  favoring  Gaussian  elimination,  the  multiplications  needed  to  select  the  pivot  ele¬ 
ments  were  not  counted.  Furthermore,  we  assumed  that  a  data  structure  was  used  with  Gaus¬ 
sian  elimination  that  required  multiplications  to  be  carried  out  only  when  two  nonzeroes  were 
to  be  multiplied  together.  This  assumption  also  favors  Gaussian  elimination. 

The  results  appear  in  Table  3.  The  first  two  columns  indicate  parameter,  values  used  to 
generate  the  problems:  the  maximum  probability  size  and  the  maximum  number  of  nonzero 
probabilities  in  each  row.  The  third  column  refers  to  the  average  number  of  nonzero  probabili¬ 
ties  in  each  row  (averaged  over  the  rows  for  a  given  problem).  The  range  shown  corresponds 
to  the  minimum  and  maximum  such  averages  over  the  three  problems  represented  in  that  row. 
Before  discussing  the  results,  it  is  worth  mentioning  that  the  Markowitz  ordering  gave  substan¬ 
tially  better  results  for  Gaussian  elimination  than  was  obtained  with  either  the  original,  essen¬ 
tially  random,  ordering  or  the  ordering  used  by  RGS.  Indeed,  use  of  the  original  ordering 
required  anywhere  from  1.6  to  7.3  times  as  much  work,  with  an  average  of  4.5,  as  the  Mar¬ 
kowitz  ordering. 


TABLE  3  —  Number  of  Normalized  Iterations  Required  by  RGS  and  Gaussian 

Elimination  (GE) 


Number  of  Nonzero 
Probabilities  per  Row 

Number  of  Rows 

Maximum 

Maximum 

Actual 

50 

100 

200 

Probability 

Range 

RGS 

GE 

RGS 

GE 

RGS 

GE 

3 

2.4-2. 5 

mzm 

5.5 

15.6 

Kza 

18.1 

29.3 

3 

2.7 

139 

6.4 

13.8 

13.1 

17.1 

41.9 

5 

2. 6-2. 7 

HU 

8.9 

15.6 

12.8 

23.1 

43.5 

2. 7-2. 8 

16.6 

14.2 

16.2 

17.5 

37.4 

3 

2.9 

15.7 

6.7 

15.7 

17.6 

15.6 

54.4 

.63 

5 

3. 4-3. 5 

15.5 

11.1 

12.1 

14.6 

78.1 

.54 

10 

3.9-4. 1 

13.1 

16.1 

13.4 

13.5 

101.8 

.36 

5 

4. 7-4. 8 

11.5 

19.7 

12.8 

52.2 

12.6 

151.4 

.18 

10 

95-9.6 

31.3 

10.9 

81.8 

11.0 

268 

The  results  support  the  claim  that  for  problems  of  a  given  density  (number  of  nonzero 
elements  per  row)  an  iterative  method  (RGS  in  this  case)  will  outperform  GE  for  problems 
with  sufficiently  many  rows.  Indeed,  they  support  the  hypothesis  that,  for  fixed  density,  the 
number  of  normalized  iterations  (total  number  of  operations,  respectively)  required  by  RGS  is 
a  constant  (linear,  respectively)  fu..  cion  of  the  number  of  rows.  Similarly,  for  Gaussian  elimi¬ 
nation,  the  supported  relationships  are  quadratic  and  cubic  functions  for  normalized  iterations 
and  total  work,  respectively. 

The  numerical  results  also  support  the  hypothesis  that  for  problems  of  a  given  size 
(number  of  rows),  an  iterative  method  will  outperform  GE  for  problems  with  sufficiently  high 
density.  For  Gaussian  elimination,  higher  density  is  apt  to  imply  a  larger  number  of  fill-ins 
(newly  created  nonzeroes)  of  the  unanalyzed  submatrices  as  the  partial  pivoting  is  carried  out. 
Such  fill-ins  tend  to  compound  themselves  and  cause  the  number  of  multiplications  required  to 
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TABLE  4  —  Number  of  Normalized  Iterations  Required  by  Various  Algorithms  on  Each  Problem 


Problem 


Algorithm 


NR 

NNZ 

MAXP 

GE  PJ 

GS11 

GS21 

GS12  GS22 

RG12 

RGI3 

RG22 

RG23  Gill 

0121 

50 

3 

9 

5.5  72 

78 

46 

37  2 

28  9 

28.7 

20.6 

24  8 

19.4 

66,9 

62  7 

100 

3 

9 

9  6  36 

62 

34 

39  3 

23.7 

294 

23  7 

17.7 

15  6 

51  0 

76.5 

200 

3 

9 

29  3  42 

62 

29 

35.9 

23.4 

34  4 

24.5 

23  6 

18  1 

58.7 

88  0 

50 

3 

75 

6  4  52 

74 

41 

340 

23  0 

26.8 

206 

194 

16  6 

57  8 

61  9 

100 

3 

75 

13.1  33 

61 

32 

35.3 

18  9 

24  7 

22  3 

16.4 

13.8 

48.1 

78.3 

200 

3 

75 

41.9  36 

62 

35 

31  1 

229 

26.5 

21,3 

21.9 

17.1 

49  3 

71.3 

50 

5 

9 

8  9  42 

56 

33 

42  5 

23  2 

28.3 

21.4 

22  6 

18  9 

66.5 

85  8 

100 

5 

9 

12  8  49 

67 

32 

39  8 

24  5 

31.7 

21  4 

18  3 

15,6 

609 

88  6 

200 

5 

o 

43  5  40 

57 

26 

40  8 

27.1 

37.3 

26.1 

32  1 

23  1 

57.6 

90  5 

50 

10 

10  0  46 

65 

38 

31.2 

16  1 

28.5 

19  6 

17.0 

16.6 

56.3 

75.5 

100 

10 

* 

16  2  41 

63 

31 

29.6 

17  4 

36  1 

26  4 

17.0 

14  2 

50.5 

76  4 

200 

10 

9 

37  4  36 

59 

26 

35  4 

20.3 

26  6 

21.5 

18  1 

17.5 

49  5 

90  8 

50 

3 

.6 

6  7  40 

70 

43 

30.6 

21.2 

24.5 

19.3 

19.5 

15.7 

48.5 

52.5 

100 

3 

6 

17  6  30 

61 

36 

32  0 

18  5 

22  5 

19.2 

16.7 

15.7 

45.8 

71.5 

200 

3 

9 

54  4  32 

62 

36 

29.3 

18.5 

24  3 

20  2 

18.3 

15  6 

47  1 

68  7 

50 

5 

63 

11  1  30 

60 

37 

30  7 

17.8 

21.6 

17.7 

16.4 

15.5 

42  6 

62  0 

100 

5 

63 

29  5  29 

63 

36 

25.7 

16.6 

19.6 

16  8 

14.5 

12.1 

44  1 

64  9 

200 

s 

63 

78  1  28 

57 

36 

24.2 

203 

21  5 

17.8 

16.5 

14  6 

43  8 

63  1 

50 

10 

54 

16  1  25 

63 

44 

22.2 

14.7 

17  9 

16  6 

12.7 

13  1 

37.5 

54.9 

100 

10 

54 

35  9  25 

62 

38 

25.9 

18.4 

17.8 

16.5 

13,0 

13  4 

36.1 

62  3 

200 

10 

54 

101  8  25 

60 

35 

26.1 

17.3 

19  6 

17  6 

143 

13.5 

390 

71.6 

50 

5 

36 

19  7  23 

63 

45 

18.0 

15.5 

15.7 

14  5 

12  7 

11.5 

33  9 

460 

100 

5 

36 

52  2  22 

62 

43 

18  0 

16  8 

17.1 

14  4 

13.0 

12  8 

364 

47.3 

2(H) 

5 

36 

1514  22 

58 

38 

20.3 

17.9 

17.4 

15.5 

12.7 

126 

38  7 

53.2 

50 

10 

18 

31  3  15 

63 

51 

14.3 

10.9 

13.2 

120 

11.0 

110 

276 

343 

100 

10 

18 

81  8  15 

62 

47 

16  5 

12.0 

13.2 

13  2 

10.9 

10  9 

29  8 

409 

200 

10 

18 

2680  16 

61 

47 

15.4 

14  3 

13.3 

14.3 

110 

110 

27.6 

39  8 

Median 

29  6  32 

62 

36 

306 

18  5 

24.3 

19.3 

16.7 

15  5 

47  1 

649 

Mean 

44  0  33  4 

62.7 

376 

28.9 

193 

23.6 

19 

17  1 

15.0 

46  4 

659 

Symbols  used  in  table: 


NR  Number  of  rows 

NNZ  Maximum  number  of  nonzero  elements  per  row 

MAXP  Maximum  probability  in  each  row 

GE  Gaussian  elimination,  using  Markowitz  ordering 

PJ  Pre-Jacobi  using  trivial  scalar  extrapolation  for  roundoff  error  control 

GS1 1  Gauss-Seidel,  using  original  ordering  and  no  extrapolations 

GS2I  Gauss-Seidel,  using  reordered  matrix  and  no  extrapolations 

GS12  Gauss-Seidel,  using  original  ordering  and  row  sum  extrapolation 

GS22  Gauss-Seidel,  using  reordered  matrix  and  row  sum  extrapolation 

RG12  Reversible  Gauss-Seidel,  using  original  ordering  and  row  sum  extrapolation 

RG13  Reversible  Gauss-Seidel,  using  original  ordering  and  modified  row  sum  extrapolation 

RG22  Reversible  Gauss-Seidel,  using  reordered  matrix  and  row  sum  extrapolation 

RG23  Reversible  Gauss-Seidel,  using  reordered  matrix  and  modified  row  sum  extrapolation 

Gill  IERS  after  Gauss-Seidel  on  original  ordering,  using  trivial  scalar  extrapolation  for 
roundoff  error  control 

GI21  IERS  after  Gauss-Seidel  on  reordered  matrix,  using  trivial  scalar  extrapolation  for 
roundoff  error  control 
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increase  significantly.  On  the  other  hand,  if  anything,  RGS  appears  to  require  fewer  normalized 
iterations  as  the  density  increases.  An  explanation  of  this  observation  may  have  something  to 
do  with  the  following.  RGS  appears  to  converge  at  least  as  fast  as  Pre-Jacobi  which  (when 
applied  to  Markov  chain  problems)  converges  at  the  "rate”  of  the  subradius  of  the  transition 
matrix.  Morton  and  Wecker  [20]  discuss  some  upper  bounds  on  this  subradius.  These  are 
called  «-Hajnal  measures  because  they  are  based  on  the  work  of  Hajnal  [9,  10].  These  upper 
bounds  are  likely  to  decrease  as  the  density  is  increased,  for  problems  of  the  type  generated 
here. 

8.  CONCLUSION 

This  paper  has  tried  to  provide  some  insights  into  the  questions  of  which  iterative 
methods  hold  the  most  promise  for  computirg  the  expected  discounted  return  in  finite  Markov 
and  semi-Markov  chains  and  when  iterative  methods  are  preferred  to  Gaussian  elimination. 
For  the  first  question,  the  results  are  inconclusive  for  two  reasons.  One,  not  all,  competing 
methods  were  tested.  For  example,  Verkhovsky’s  [35]  method  was  not  tested.  Two,  specific 
rules  were  used  to  randomly  generate  the  twenty-seven  problems  used  to  test  the  methods.  It 
is  unclear  that  the  results  found  here  will  apply  to  other  problems.  Nevertheless,  there  were 
some  clear  observations  that  applied  to  the  problems  that  were  generated.  Gauss-Seidel  is  not 
preferred  to  Pre-Jacobi  for  Markov  chain  problems  in  general,  although  it  is  preferred  if  the 
matrix  is  reordered  and  the  row-sum  extrapolation  is  used.  The  implied  equal  row-sum 
transformation  did  not  yield  improved  results.  Reversible  Gauss-Seidel  did  yield  slightly 
improved  performance,  provided  that  the  matrix  was  reordered  and  the  row-sum  extrapolation 
was  used.  A  further  slight  improvement  was  obtained  when  a  rule  for  only  applying  the  extra¬ 
polations  occasionally  was  used.  Whether  these  improvements  are  worth  the  extra  coding 
needed  to  implement  them  remains  to  be  seen.  However,  it  appears  that  iterative  methods  are 
preferred  to  Gaussian  elimination  for  some  problems  with  as  few  as  50  rows  and  for  many 
problems  that  have  200  rows  or  more. 

APPENDIX:  PROBLEM  GENERATION 

All  twenty-seven  problems  generated  were  Markov  chain  problems  with  a  discount  factor 
of  .9  (a  =  a  «*  .9).  Three  other  parameters  were  used  to  generate  the  problems.  Their  values 
for  each  problem  are  shown  in  Table  4.  NR  is  simply  the  number  of  rows.  NNZ  and  MAXP 
were  used  to  generate  the  probabilities  in  a  row. 

The  locations  of  the  nonzero  probabilities  in  each  row  were  generated  randomly.  Working 
sequentially,  a  location  was  chosen  from  a  discrete  uniform  distribution  over  the  positions  not 
yet  having  a  nonzero  element.  The  values  of  the  elements  were  generated  from  a  discrete  uni¬ 
form  distribution  on  the  interval  (.001,  q]  where  q  :  —  min  (MAXP, REM),  and  REM  is  the 
discount  factor  less  the  cumulative  sum  of  probabilities  already  generated  in  that  row.  If  the 
resulting  cumulative  row  sum  did  not  equal  the  discount  factor  and  the  maximum  number  of 
nonzero  elements  (NNZ)  had  not  yet  been  attained  for  that  row,  then  another  location  was 
generated.  Because  a  minimum  element  size  (equal  to  .001)  was  used,  the  maximum  number 
of  nonzero  elements  was  not  always  attained.  The  last  nonzero  element  generated  in  each  row 
was  adjusted  so  that  the  row  sum  would  equal  the  discount  factor. 

The  immediate  returns  (making  up  the  vector  r)  were  generated  as  follows.  The  first  ten 
were  set  equal  to  lOn,  where  n  denotes  the  row  number.  The  last  was  set  equal  to  zero,  and 
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the  remaining  ones  were  generated  from  a  continuous  uniform  distribution  on  the  interval  (0, 
1).  The  rationale  behind  this  rule  is  outlined  in  Porteus  [221 
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ABSTRACT 

We  investigate  a  two-echelon  (base-depot)  inventory  system  of  recoverable 
(repairable)  items.  The  arrivals  of  demand  at  the  bases  are  in  a  Poisson 
manner  and  the  order  sizes  are  random.  The  failed  units  can  be  repaired  either 
at  the  base  or  at  the  depot,  and  the  units  beyond  economic  repair  are  con¬ 
demned.  Inspection  of  the  failed  units  is  carried  out  in  the  batches  they  arrive, 
that  is,  arrival  batches  are  not  broken  up.  The  exact  expressions  for  stationary 
distribution  of  depot  inventory  position,  and  of  the  number  of  backorders,  on- 
hand  inventory,  in-repair  inventory  at  all  locations  are  derived  under  the  as¬ 
sumptions  of  constant  repair  and  lead  times.  Special  cases  of  complete  recover¬ 
ability,  nonrecoverability.  and  of  the  unit  order  size  are  also  discussed. 


I.  SYSTEM 

Consider  a  two-echelon  inventory  system  consisting  of  a  set  of  bases  (lower  echelon)  and 
a  central  depot  (upper  echelon).  Each  location,  in  addition  to  being  an  item  stocking  point,  has 
facilities  to  perform  repairs.  Figure  1  shows  the  schematic  diagram  of  the  system  under  con¬ 
sideration.  The  system  demands  are  generated  at  bases.  Customers  while  placing  requisitions 
for  certain  number  of  items,  turn  in  a  like  number  of  failed  units.  The  item  stocked  in  this 
system  is  recoverable.  Upon  failures,  units  are  returned  to  base  where  a  decision  is  made  e  ther 
to  remove  (condemn)  the  units  from  the  system  or  to  perform  repairs  on  them  either  at  the 
base  itself  or  at  the  depot  in  order  to  restore  the  units  to  a  serviceable  condition.  The  decision 
to  repair  or  to  condemn  the  failed  units  is  based  on  the  degree  and  the  nature  of  failure,  the 
repair  facilities  available,  and  the  economics  involved.  Once  an  item  is  designated  as  recover¬ 
able,  it  is  presumably  more  economical  to  repair  the  item  than  it  is  to  dispose  of  it  and  replace 
it  with  a  new  item. 

The  system  is  further  characterized  by  the  following  descriptors: 

(1)  Arrival  Pattern  and  Order  Size.  The  failures  which  generate  the  system  demands 
occur  in  a  Poisson  manner  with  known  and  constant  rate.  Upon  such  failures,  a  random 
number  of  units  is  demanded  for  replacement. 

(2)  Resupply  at  Bases  and  Depot.  The  bases  are  resupplied  as  necessary  only  by  the 
depot.  That  is,  lateral  supply  among  the  bases  is  not  allowed.  The  procurement  of  items  from 
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Figiri  1.  Two-echelon  inventory  system  for  recoverable  items 

the  external  supplier  to  make  up  for  the  condemnations  i1-  made  only  by  the  depot.  Each  loca¬ 
tion,  in  addition,  receives  supplies  from  its  repair  station. 

(3)  Inspection  Policy  at  Bases.  The  inspection  of  failed  units  is  carried  out  at  the  same 
base  where  the  requisition  for  replacement  arrived  immediately  after  failure.  Two  inspection 
policies  suggest  themselves.  The  first  policy.  Batch  Inspection  Policy,  determines  if  a  batch  of 
failed  units  as  a  whole  is  either  base  repairable,  depot  repairable  or  condemnable.  From  a  prac¬ 
tical  view  point,  this  policy  represents  situations  where  the  units  of  a  batch  (or  a  module)  fail 
simultaneously  for  the  same  reason  and  the  extent  of  damage  is  the  same  for  all  units  within 
the  batch.  In  the  second  policy.  Unit  Inspection  Policy,  each  failed  unit  in  a  batch  is  inspected 
independently  to  determine  whether  the  unit  is  base  repairable,  depot  repairable  or  condemn¬ 
able.  This  policy  represents  the  situations  where  units  failed  under  difficult  conditions  but  are 
submitted  in  a  batch  for  replacement,  or  the  situations  where  decision  is  very  critical  and  calls 
for  inspection  of  each  individual  unit  in  a  batch.  In  this  paper  we  shall  consider  Batch  Inspec¬ 
tion  Policy.  It  is  further  assumed  that  the  inspection  takes  negligible  time. 

(4)  Procurement  Policies.  The  bases  use  an  (s  -  I.  s)  policy  for  procurement  of  units 
from  the  depot.  The  depot  procurement  policy  is  a  general  continuous  review  (s,  S)  policy. 
Policies  at  all  locations  are  in  terms  of  inventory  position  defined  to  be  the  sum  of  on-hand, 
on-order  and  in-repair  inventories  minus  any  backorders. 
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(5)  Backlogging.  Demands  occurring  when  a  location  is  out  of  stock  are  backlogged. 
Further,  partial  backlogging  of  demands  is  allowed.  For  example,  upon  arrival  of  a  requisition 
if  the  base  does  not  have  the  number  of  units  demanded,  then  all  the  units  on  hand  are  sup¬ 
plied  while  the  balance  is  backlogged.  The  respective  base,  however,  accepts  the  whole  batch  of 
failed  units  and  starts  the  inspection.  Similarly,  partial  backlogging  is  done  at  the  depot  for  the 
base  demands. 

(6)  Repairs.  There  are  ample  repair  facilities  at  each  location.  The  repair  times  at  the 
bases  and  at  the  depot  are  deterministic  and  independent  of  arrival  process  and  of  the  number 
of  units  in  repair.  Furthermore,  the  depot  repair  lime  is  the  same  for  all  the  units  received 
from  all  the  bases.  Repairs  at  all  locations  are  carried  out  in  a  continuous  manner— that  is.  no 
batching  is  done.  Upon  completion  of  repairs,  the  units  immediately  join  the  stock  of  service¬ 
able  items. 

(7)  Shipping  and  Lead  Times.  The  time  to  ship  a  depot-repairable  unit  to  the  depot  from 
a  base  is  assumed  to  be  negligible.  In  reality,  it  can  be  absorbed  in  the  depot  repair  time.  The 
order-and-ship  time  of  unit  from  depot  to  a  base  is  assumed  to  be  deterministic.  Similarly, 
order-and-ship  time  of  an  order  from  external  supplier  to  the  depot  is  also  deterministic. 

The  objective  function  of  the  total  expected  cost  and  several  measures  of  performance  of 
the  inventory  system  are  related  to  the  stationary  distribution  of  depot  inventory  position,  and 
number  of  backorders,  on-hand  inventory,  and  in-repair  inventory  at  different  locations.  Our 
objective  in  this  paper  is  to  determine  the  exact  expressions  for  stationary  distribution  of  these 
inventory  levels. 

A  fundamental  work  on  two-echelon  inventory  system  was  the  development  of  METRIC 
by  Sherbrooke  [7]  for  a  completely  conservative  system  that  does  not  allow  item  condemna¬ 
tions.  He  considered  the  problem  of  allocating  several  units  among  a  depot  and  several  bases  in 
order  to  minimize  the  total  expected  number  of  backorders  at  bases  within  the  limitations  of  a 
budget.  The  resulting  expressions  for  stationary  distribution  of  backorder,  however,  are 
approximate.  A  variation  of  the  METRIC  model  was  introduced  by  Simon  [8]  to  obtain  the 
exact  expressions  for  stationary  distribution  of  backorders,  allowing  condemnations.  Simon's 
analysis  was  limited  to  the  case  of  unit  order  size.  For  unit  order  size,  Muckstadt  [2,  31  has 
presented  analyses  of  the  system  where  bases  use  (s,  SI  and  (r.  Q)  ptocurement  policies. 
Shanker  [51  has  analyzed  the  case  of  random  order  size  with  condemnations;  the  condemnation 
rates,  however,  were  assumed  to  be  the  same  at  all  bases.  Our  present  analysis  is  more  general 
than  previous  works  in  that  random  order  size  and  condemnations  both  are  considered.  It. 
however,  is  less  general  than  METRIC  in  that  repair  times  here  are  assumed  deterministic. 

2.  MODEL 

We  consider  a  two-echelon  system  as  depicted  in  Figure  1  with  J  bases,  the  bases  num¬ 
bered  from  1  to  J  and  the  depot  indexed  as  0.  The  failures  which  generate  the  system  demands 
occur  in  a  Poisson  manner  with  known  rate  \j  at  base  y(y  =  1,  2,  ....  J).  Upon  such  failures 
at  base  y,  the  number  of  units  demanded,  or  equivalently,  the  number  of  failed  units  turned  in, 
has  probability  mass  function  k  ^  I,  with  finite  mean.  In  the  present  context  of  batch 

inspection  policy,  the  entire  batch  of  failed  units  is  repaired  at  the  base  with  probability  r,,  is 
shipped  to  the  depot  for  repair  with  probability  (1  -  r;)  pr  or  is  condemned  with  probability 
(1  -  r,)(  1  -  pj).  Thus  at  the  base,  the  requisitions  are  of  three  types:  base-repairable,  depot- 
repairable  and  condemnable. 
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Notation 


T, 

Pj 


(j,  -  1.  Sj) 
(so,  So) 

Rj 

tj 

TO 

NjU) 

Nfit) 

Nj>it) 

NfU) 

NjHt) 

Atf(r) 

Kit) 


—  the  rate  of  Poisson  arrival  of  requisitions  at  location  j(j  =  0,  1 . 7 ), 

=  the  probability  mass  function  of  order  size  at  location  jij  =  0,  1 . 7 ), 

=  the  probability  that  a  batch  is  repaired  at  base  j(j  =  1,  2,  ....  7), 

=  the  probability  that  a  batch  which  is  not  repairable  at  base  j  is  repaired  at  the 
depot;  (l-p7)  is,  therefore,  the  probability  of  condemning  a  base- 

nonrepairable  batch  {j  =  1,  2 . 7 ), 

=  the  procurement  policy  at  base  j(j  =  1,  2,  ....  7 ), 

=  the  procurement  policy  at  the  depot 

=  the  deterministic  repair  time  at  location  j  ij  =  0,  1 . 7), 

=  the  deterministic  delivery  time  from  the  depot  to  base  jij  =1,2 ,  7), 

=  the  deterministic  procurement  lead  time  from  the  external  supplier  to  the  depot, 
=  the  total  number  of  requisitions  that  arrive  at  location  j  during  the  interval 
(0,  r]  (7  =  0,  1 . 7), 

=  the  number  of  requisitions  at  base  j  during  the  interval  (0,  /]  for  which  the 
entire  batch  was  declared  base  repairable  ij  =  1.  2,  . . .  ,  7), 

=  the  number  of  requisitions  at  base  j  during  the  interval  (0,  rj  for  which  the 
entire  batch  was  sent  to  the  depot  for  repair  j  =  1,  2,  ....  7), 

=  the  number  of  requisitions  at  base  j  during  the  interval  (0,  r]  for  which  the 
entire  batch  was  condemned  {j  =  1,  2,  ....  7), 

=  the  total  number  of  requisitions  placed  at  the  depot  by  base  j  during  the  inter¬ 
val  (0,  /]  0  =  I,  2 . 7), 

=  the  total  number  of  requisitions  at  the  depot  during  the  interval  (0,  /]  as  a 
consequence  of  condemnations  at  the  lower  echelon, 

=  the  total  number  of  requisitions  at  the  depot  during  the  interval  (0,  r]  for 
which  the  batches  were  found  depot  repairable. 


The  variables  Dj(t),  DB(t),  Df(t ),  Df(t ),  Dfr (r)  and  Dff(t)  shall  denote  the 

number  of  units  demanded  by  the  corresponding  requisitions  N,(r),  Nf(l),  N;°(r),  Nf(i ), 
Nj°(t),  A ro(t),  and  A respectively.  For  instance,  Z>,°(r)  denotes  the  total  number  of  units 
demanded  from  the  depot  by  base  j  during  the  interval  (0,  r],  (J  =  1,  2 . 7);  and  so  on. 


<?,(') 

Z,(r) 

Bj(t) 

UjU) 

P[rt\m\ 
CPI*  I  At,  /] 


r  (•) 

Eo 

N(/„  f2) 


the  in-repair  inventory  at  time  t  at  location  j(J  =  0,  1 . 7), 

the  inventory  position  at  time  rat  location  j(j  —  0,  1 . 7), 

the  number  of  backorders  at  time  r  at  location  j  (J  =  0,  1,  ....  7)  (Negative 
backorders  indicate  on-hand  inventory), 

the  total  number  of  units  on  order  plus  in  repair  at  time  r  at  location  j(j  =  0, 

1 . J ), 

m ) n 

- ; - ,  n  *  0,  1,  ...  (Poisson  distribution  with  mean  m ), 

n\ 

T  - ; - /"Hk),  k  =  0,  1,  ...  (Compound  Poisson  distribution  with 

n-0 

parameter  A  rand  compounding  distribution  /); 

/j-fold  convolution  of  density  function  /(•), 

I s0  +  1.  Jo  +  2 . S0},  the  state  space  for  inventory  position  at  depot, 

A r(t2)  -  N(i* )  for  the  process  [Nit),  I  >  0). 
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Lower  case  letters  denote  a  particular  realization  of  a  random  variable. 

We  note  the  following  implications  of  our  assumptions: 

(a)  Obviously,  Nj(t)  =  NfU)  +  Nf(t)  +  NfU),  for  all  t  ^  0.  It  can  be  easily  seen  (see 
references  [51,  [81)  that  the  processes  l NfU).  t  >  0},  [Nf(t),  i  ^  0),  and  [NfU),  t  ^  0)  are 
mutually  independent  Poisson  processes  with  parameters  X/—  rjKt,  kf  —  (1  -  r;)(l  -  Py)Xy, 

and  x/*-  (1-  rj)pjkj%  respectively,  for  j  -  1,  2 . 7.  Consequently,  the  demand 

processes  [ t  >  0),  { Df(t ).  t  >  01  and  [Off/),  t  >  0)  are  compound  Poisson  processes 
with  parameters  X  j®,  X  f  and  X  /\  respectively,  and  have  a  common  compounding  distribution 
*,(•). 

(b)  Because  the  bases  use  an  (s  -  1,  s)  policy,  /V°(r)  =  /V/Yr)  +  Nflt).  for  all  /  >  0 

and  j  =  1,  2 . 7.  Consequently,  (N/Hr),  t  ^  0)  is  a  Poisson  process  with  parameter 

X®  —  x/  +  kf.  Furthermore,  because  the  bases  operate  independently,  [#£(/),  t ^  0}  and 

{Nft(t),  t  ^  0)  are  independent  Poisson  processes  with  parameters  X<f  =  £  XyC  and 

j  J~l 

ktf  =  £  x/\  respectively.  Moreover,  A0(r)  *■  AffO)  +  A$(r)  for  all  t  >  0,  and  thus 

,”1  y 

(iV0(t).  r  s?  0)  is  a  Poisson  process  with  parameter  X0  =  £  X®.  Again  it  can  be  shown  (see 

reference  [51)  that  the  demand  processes  {Z)0(r) ,  t  >  0),  (Dq  (r),  /  >  0)  and  { Dfi(t),  t  ^  0) 
are  compound  Poisson  processes  with  parameters  X0,  X0C  and  X(f;  and  compounding  distribu¬ 
tions  0o  (* ) ,  0<f(‘)  and  0OD('K  respectively;  where 

<M*)  -  ff  *>;<*).  0oc<*)  =  7c  I  kfrj(k) 

Xo  jwm  I  X0  y.,! 

and 

0ofl(X)  -  £  kfyjikk  for  k  =  1,  2 . 

Xo  y-| 

(c)  As  a  consequence  of  the  ?y  -  1.  Sy)  policy  at  base  j,  Zjit)  -  s,  for  all  r  >  0,  or 

equivalently,  d/,(r)  -  Bj(t)  -  for  r  ^  0.  Then  for  any  6  6  |-jy,  — Sy  +  1 . 0,  1,  . . .} 

and  for  any  t  >  t0  +  ry;  the  event  Bj(t)  -  b  occurs  if  and  only  if  Uj(t)  -  Sy  +  b. 


(d)  Because  of  the  infinite  number  of  repair  facilities  and  constant  repair  times,  the  units 
in  repair  at  base  j  at  time  t  (>/?;)  will  be  due  to  the  base  repairable  failures  occuring  only  in 
(t  -  Rj.  /];  that  is  Qj(t)  -  Df(t  -  RJt  /).  Thus  for  t  ^  Rj,  Qt(t)  is  a  compound  Poisson 
with  parameter  kfRt  and  compounding  function  0y(-).  Similarly,  Q0(t )  is  a  compound  Poisson 
with  parameter  Xo }R0  and  compounding  distribution  0^(*). 


We  first  describe  the  basic  approach  for  determining  stationary  distribution  of  system 
backorders  in  Section  3.  In  Section  4,  the  stationary  distribution  of  backorders/on-hand  inven¬ 
tory  at  bases  is  obtained.  The  results  for  in-repair  inventory  are  obtained  in  Section  5,  and 
those  for  depot  backorder/on-hand  inventory  are  given  in  Section  6.  The  special  cases  of  com¬ 
plete  recoverability,  nonrecoverability  and  unit  order  size  are  discussed  in  Section  7.  Section  8 
indicates  applicability  and  applications  of  the  results  derived  in  the  present  study. 
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3.  BASIC  APPROACH  FOR  STATIONARY  DISTRIBUTION 
OF  SYSTEM  BACKORDERS 

We  emphasize  that  the  system  backorder  means  the  backorders  at  the  bases.  Depot 
backorders  are  of  interest  only  in  so  far  as  they  affect  base  backorders.  Stationary  distribution 
of  (Z0(/),  t  ^  0|,  the  inventory  position  at  the  depot,  will  be  required  to  derive  the  expres¬ 
sions  for  stationary  distribution  of  backorders  at  bases.  The  depot,  for  this  purpose,  can  be 
treated  as  a  single  location  system  where  arrive  two  types  of  demands,  recoverable  and  nonre- 
coverable,  and  the  corresponding  processes  {Dq(i),  i  ^  0)  and  {Z>^  (r) ,  t  ^  0|  are  indepen¬ 
dent  compound  Poisson.  Further,  the  depot  inventory  position  changes  only  at  the  arrival 
epochs  of  nonrecoverable  demands  from  the  bases.  It  remains  unchanged  at  the  arrival  epochs 
of  recoverable  demands  because  in  case  of  recoverable  demands,  on-hand  inventory  decreases 
and  in-repair  inventory  increases  by  the  same  amount  with  no  change  in  inventory  position. 
Then  stationary  distribution  of  (Z0O),  /  >  0|  is  given  by  (see  Sahin  [4],  Shanker  [5),  Tijms 
[9]), 


(1)  lim  Pr{Z0(t)  =  Ar|Z0(0)  -  /)  =  %0(k)  = 

/ — »oo 


where 

m(l)  —  <f>oc(D;  M(k)  =  <t>£(k)  +  £  <t>f(k  -  q)m(q),  k  ^  2 

4-1 

and 

M(k)  =  £  m(p),  k  >  1. 

4-1 


m(SQ  -  k) 


1  +  M(S0-  s0-  1) 

i 


1  +  M(S0-  s0-  1) 


s0  +  1  <  k  <  S0  —  1 
k  =  S0 


To  find  the  stationary  distribution  B*(b)  =  lim  Pr{Bj(t )  =  b I  at  base  j,  we  first  obtain 

Pr[Bj(t)  =  6 },  or  equivalently  Pr\Uj(t)  =  s,  +  6)  for  b  €  (— ,  -sy  +  1 . 0,  1,  ...)  and 

then  evaluate  B*(-).  We  shall  follow  the  approach  suggested  by  Kruse  and  Kaplan  [I I. 

Referring  to  Figure  2,  the  only  units  that  can  arrive  at  base  j  from  the  depot  by  time  /  are 
those  on  order  by  time  t}.  This  depends  on  the  total  assets  available  at  the  depot  by  time  r3, 
the  total  demand  at  the  depot  during  the  interval  (th  /3],  and  the  sequence  of  arrivals  of 
requisitions  at  the  depot  from  bases  during  the  interval  (/,.  r3].  This  is  so  because  the  units  on 
order  through  time  it  will  have  arrived  at  the  depot  from  the  external  supplier  by  time  t}.  The 
total  assets  available  at  the  depot  by  time  t}  include  the  units  on  hand  minus  any  backorders  at 
time  r,,  the  units  on  order  at  time  ,  the  units  in  repair  at  time  t\  and  the  units  received  for 
repair  during  the  interval  (tt,  t2 ].  This  equals  z0(t |)  +  dj){tl,  t2).  Now  the  following  two 
mutually  exclusive  situations  are  possible: 

(A).  The  total  depot  demand  during  the  interval  (tt,  r2l  does  not  exceed  the  total  assets 
available  by  time  /3;  that  is, 

d$(t\.  t2 )  +  d§(t\,  i2)  <  z0(r,)  +  <#0,,  t2) 
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tl  1 

_ 1 _ 1 

J _ > _ 1 

- *0 

*  R  rj 

I ,  J 

"o  *1 

- -  ■■  -j 

r — *j— 1 

i 

Figure  2.  Time  intervals  at  base  j 


or 

4 (ft>  ^  zo('i)- 

Thus  all  the  depot  demands  <f0(r1(  ^  “  doO\*  t2)  +  4 (*t.  are  satisfied  by  time  /3.  Only 

the  depot  demands  d0(/2,  r3)  against  the  stock  of  z0(f])  ~  4 (*i<  units  available  by  time  t3 
determine  how  many  demands  could  possibly  remain  unsatisfied  by  time  t}. 

(B):  The  total  demand  during  the  interval  (/,,  r2]  exceeds  the  total  available  stock  by 
time  r3;  that  is, 

4('|.  *2)  +  4 (f|.  r2)  >  r0(r)  +  4^i-  ^ 


4^i.  f?)  > 

Thus  there  is  no  stock  available  at  the  depot  at  time  t2  to  satisfy  the  demands  dn(t2,  t}).  Also, 
there  is  no  guarantee  that  all  the  d®(tx,  demands  will  be  satisfied  since  this  depends  upon 
the  sequence  of  arrivals  of  r2)  and  t2).  Hence,  the  total  demand  d0U\,  r2) 

drawn  against  the  amount  of  z0(r,)  +  d$(t\,  t2)  determines  how  many  demands  will  remain 
unsatisfied  by  time  t}.  We  can  write 

(2)  Pr{Bj(t)  -  b)  -  Pr{UjU)  -  Sj  +  b] 


“  4 *2^*  z0(r,)  “  ro(/|))  ’  Pr{Dfi (t\,  t2) 


“  4<'h  t2)\  Zgitj)  “ 


Using  the  independence  of  t2)  and  Z0(/|)  we  can  express  Equation  (2)  corresponding  to 

the  cases  (A)  and  (B)  as  follows: 

(3)  Pr{Uj(t)  -  sj  +  il  -  T  [Pr{UjU)  -  j,  +  b)A  +  Pr{U2(t)  -  Sj  +  b)B\ 

z0(t{!tE0 

•  Pr{Z0U\)  -  z0(/|)| 


where 

(4) 


r0('|< 

Pr{Uj(t)  -  Sj  +  b)A-  £  WUjlO  ~  *j  +  » W»i.  h)  “  4('i.  f2>; 

4('l'2)_0 

Zq(/|)  “  ro(,|)) '  (f|,  l2)  ”  4 t2)|). 
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and 

(5)  Pr{Uj(t)  -  Sj  +  b)B  -  £  \[Pr  UjU)  -  s,  +  b\D$(tx.  t2) 

»  d£ (/,.  /2);  Z0(?i)  -  zo(f)))  *  /V{D£ (/,,  /2) 

“  dfi  (t\> 

For  a  given  zqU\),  Equations  (4)  and  (5)  represent  Pr[Bj{t)  -  b }  for  cases  (A)  and  (B), 
respectively.  In  the  next  section  we  shall  derive  the  expressions  for  Pr{Uj(t)  -  j;  +  b\A  and 
Pr{Uj(t)  =  Sj  +  6},s  which  upon  substitution  in  Equation  (3)  will  yield  Pr[Uj(t)  —  Sj  +  6). 

4.  STATIONARY  DISTRIBUTION  OF  BACKORDERS/ON-HAND 
INVENTORY  AT  BASES 

4.1  (A):  Pr{U,(t)~  s,  +  b}A 

To  evaluate  Pr{Uj(t)  =  s3  +  b\A  using  Equation  (4),  let 
Uj(t)  -  UjHt)  +  UjU). 

where 

UjU)  -  the  sum  of  units  in  repair  at  base  j at  time  i and  the  units  for  which  order  were 
placed  on  the  depot  by  base  j  during  the  interval  (t3,  /), 

and 

Uj(t)  =  the  units  ordered  from  the  depot  by  base  j  during  the  interval  (t2.  t}]  that 
remain  unfilled  by  time  t}. 

Because  the  arrival  process  is  Poisson,  UjU)  and  UjU)  are  independent.  As  mentioned  ear¬ 
lier,  all  the  demands  levied  on  the  depot  from  base  j  during  the  interval  (/3,  tl  will  remain  on 
order  at  time  t.  Also  the  base  repairable  demands  occurring  only  during  the  interval 
( t  -  Rj,  /]  will  be  in  the  repair  cycle  at  base  j  at  time  t.  Thus  UjU)  -  DfU  -  Rj%  t )  +  DjU}> 
t)  +  DfU3,  t),  and  the  probability  distribution  of  UjU)  can  be  easily  obtained.  The  probabil¬ 
ity  distribution  of  UjU)  requires  considering  the  sequence  of  arrivals  of  requisitions  from  the 
bases  during  the  interval  (r2*  fjl-  Once  the  probability  distributions  of  UjU)  and  UjU)  are 
obtained.  Equation  (4)  can  be  evaluated  by  taking  the  convolution  of  Uj(t)  and  UjU).  Since 
UjU)  is  independent  of  Z0(/|)  and  D£(/t,  i2),  we  can  write 

(6)  Pr[UjU)  -  sj  +  b\A  - 

Z0<(,)  Jy+6 

£  £  Pr{UjU)  -  sj  +  b-  d]  -  Pr{UjU)  -  d\D$ (/,.  tj)  -  0,.  r2); 

d£(i,,r})-0  <>-0 

Z0(/,)  -  zoOi)}J  •  Pr\D$U\,  t2)  -  <t§ U\,  f2)}. 
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As  observed  in  Section  2,  U}U)  has  a  compound  Poisson  distribution  with  parameter 
X fR,  +  (X‘  +  X,°)t,  and  compounding  distribution  </>,(•).  Therefore, 

(7)  Pr{Uj(t)  =  s,  +  b  -  d I  =  CPIs,  +  Z>  -  d\k*R,  +  <X‘ '+  X<  >r,.  <*,J. 

For  the  purpose  of  evaluating  Pr{U}(t)  =  di,  the  demands  ai  the  depot  can  be  viewed  as 
arising  from  two  sources.  One,  the  base  j  for  which  the  distribution  is  being  determined  and 
the  other  being  set  of  the  remaining  bases  (see  Figure  3).  Let  us  denote  this  set  by  <r,  that  is, 

a  =  {I,  2 . /  —  I,  j  +  I . J |.  Since  the  bases  operate  independently,  the  two  sources 

are  independent.  For  the  source  <r,  we  shall  use  the  notations  similar  to  those  used  for  an  indi¬ 
vidual  base.  Thus.  /Vr(/)  denotes  the  total  number  of  requisitions  that  arrive  at  source  <r  dur¬ 
ing  the  interval  (0,  /),  and  so  on.  The  processes  (t),  t  >  0),  (A/,(r  (/).  /  ^  0|  and  (A 
t  >  0)  are  mutually  independent  Poisson  processes  with  parameters  X,?  =  £  X,fl.  X,J  =  £  X,1 

i  f  it  if.tr 

and  X,f  =  £  x/\  respectively.  Further,  the  demand  processes  I £>,f  ( r ) ,  i  ^  0|.  (£Fr(t),  t  ^ 

f  €  tr 

0),  and  /  >  0}  are  compound  Poisson  processes  with  parameters  A*,  A,J.  and  A,,, 

respectively.  Their  respective  compounding  distributions  are: 

<»,?(*)  =  ~  £  *,**,<*>.  =  TT  I  *'*.<*>  and 

iftr  ™  tr  if  tr 

</>,?(*)  =  £  X/^,(X);  for  *  ^  I. 

A,r  i  €  tr 

Also,  the  process  { 2V ®  t ) .  t  >  0)  is  a  Poisson  process  with  parameter  X,?  =  £  X"  and  the 

/  £  «r 

demand  process  |D”(r),  /  >  0)  is  a  compound  Poisson  process  with  parameter  X,”  and  com¬ 
pounding  distribution  <£,?(X)  =  £  bf<b,(k),  for  X  >1. 

^<r  i«<t 


SOURCE  j*  BASE  j  SOURCE  <T  »  BASES  j  j-M,...  J  J 

Figure  3.  Lower  echelon  as  two  independent  sources:  /  and  it 


The  random  variable  U/it)  represents  the  units  ordered  from  the  depot  by  base  /  during 
the  interval  (t2,  /3]  that  remain  unfilled  by  time  /j.  Referring  to  Equation  (6),  we  shall  obtain 
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Pr{U}(t)  =  d\D$  fft*  li)  *  4  (/[.  /p;  Za(/()  =  z0(/,)}  by  further  conditioning  on  .V(°(r2,  /,); 

/  -  j.  a.  Let  K/l  =  /3)  -  nr  N^t(i2.  r3)  =  05'(r,,r2)  =  4  (r,,  /,);  Z()</,)  = 

Thus,  we  first  obtain  Pr{U?(t)  —  4/L4),  and  to  do  so  we  need  to  know  the  number 
of  requisitions  at  the  depot  placed  during  (r2,  r3l  and  those  completely  satisfied  by  time  ty  Let 

/V,'(r2.  fj)  =  the  number  of  requisitions  from  source  ;  that  arrived  during  (i2,  /,]  and  are 
completely  satisfied  by  time  t}\  for  /  =  j,  a. 


Suppose  we  are  given  RA  and  N‘(r2,  /})  =  n'r  Then  U;(t)  =  d  if  and  only  if  the  sum  of 
the  demands  due  to  the  unsatisfied  ~  n]  requisitions  and  unsatisfied  units  of  possibly  a  par¬ 
tially  satisfied  requisition  (if  from  base  j)  equals  d.  Let  EX  denote  the  number  of  units  sup¬ 
plied  to  the  requisition  whose  demand  is  only  partially  met.  The  range  of  the  random  variable 
EX  is  from  0  to  z0(f j)  —  4  (f|.  f2).  When  EX  =  0,  there  is  no  partially  satisfied  requisition 
and  when  EX  =*  z0(/,)  -  d{>  (th  i2),  no  requisition  is  completely  satisfied  and 
-  4  ( ti ,  r2)  units  are  supplied  to  the  first  requisition,  if  any  (see  Figure  4). 


Figure  4.  A  sample  realization  of  EX,  Nj(t2.  is )  and  (r2,  r3)  given  RA 
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Let  us  introduce  an  indicator  variable  /  such  that  /  =  /  if  the  partially  satisfied  requisition 
is  from  source  i ;  /  =  j,  <r.  Then  we  have  the  following: 


(8) 

PriUjHt)  = 

d\RA\ 

EX  -  0, 

N]U2,  r})  = 

n'r 

- 

( d); 

(9) 

PrW/U)  » 

d\RA\ 

EX  =  e 

>  0.  Nj(t 2,  i 

■3)  =  nj;  )V;(/2.  /})  =  /  =  0-1 

<t>)n-rn> 

(d); 

and 

(10) 

PrWjHl)  «  d\RA. 

,  EX  =  ? 

rs 

o 

A 

'3)  =  (v;(/2,  /3)  =  «;•.  /  =  y) 

= 

d 

- 

(k  +  e)  • 

.  (n  —n\  1) ,  . 

4>,  '  '  (d 

-  k). 

k>  0 


The  Pr\Uf(t)  —  d\RA]  can  be  obtained  from  Equations  (8-10)  by  first  computing 
Pr{EX  =  c(>0);  /  ==  /,  Nj(i2,  i3)  =  n':  N^(t2,  i3)  =  n^\RA\,  for  /  =  y,  a.  Let  Yk  denote  the 
number  of  units  demanded  from  the  depot  by  the  Ath  (Ar  ^  1)  requisition  during  (t2,  /3], 

We  consider  the  following  two  situations.  One,  where  not  all  the  requisitions  during 
(t2.  /j)  are  satisfied  by  time  /3,  and  the  other,  where  they  are  all  satisfied. 

(i):  0  ^  nj  +  rv  <  rtj  +  n„  ^  I,  not  all  the  requisitions  in  (t2,  r3]  are  satisfied 

by  time  t}.  We  wish  to  compute 

(11)  Pr{EX  =  e\  I  =  y;  Nj(t2,  t3)  =  nj;  Nj((2.  t3)  -  nj\RA) 

=  +  Y2  -F  ...  +  =  r0(/|)  -  4'(/t,  t2)  -  e ; 

T«j+n<r+i  >  c  ;  out  of  the  first  (nj  +  requisitions  at  the  depot  nj  are  from  source 
j,  and  in]  +  nj  +  l)sl  requisition  is  from  source  j\RA}. 


We  have 

(12)  Pr{Y]+Y2+...+  Yn.+na-ZoUi)-dt(h,  t2) i-e;  K,;+<+1>e|«4| 

=  <(>0  '  17  (Zq(I\)  —  (fo  (t],  ~  c)  •  £  <t>j{p). 

p>e 


/*Hout  of  the  first  (nj  +  n^)  requisitions  at  the  depot,  nj  are  from  base  y,  and  (nj  +  n„  +  l)sl 
requisition  is  from  source  y(/L4)  can  be  derived  using  the  results  on  sequences  of  Poisson 
arrivals  (see  Simon  18),  Shanker  [51);  and  is  given  by 


(13) 


"3  K 

4-  1 

n'i  +  1  1  < 

«,'+<  +  1 

"y  +  ”o-  J 

«;  +  rC  +  1 J 
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Substituting  Equation  (12),  and  Equation  (13)  upon  simplification,  into  Equation  (12)  we  have 

(14)  Pr{EX  -  e\  I  =  j\  Nj(tt.  t3)  =  nj\  N'a(t2.  /j)  -  %\RA) 


(rtj  +  na)  -  (»'  +  n'a)  «y  4-  na 
n]  +  na 


(n‘j+t T) 


(zq(/i)  —  do  (t 


l2)  -  e ) 


I  */</>) 


P><’ 


for  e  =  0,  1 . z0(/i)  -  4  ('t-  '2>  -  (»J  +  0  <  n)  <  «>• 

Similarly, 

(15)  Pr[EX  =  e;  I  =  <r;  2V;(/2.  /3)  =  A/;(/2,  r3)  =  nJ/M} 


(”<,  ~  4) 

(rij  +  na)  -  (n'  +  n£) 


«<r 

4 


rtj  +  n„ 
n'j  +  4 


(z0(/l)  -  4^  (*1. 


/2)  -  ?) 


•  I  0°o>) 

p>e 


for  e  -  0,  1 . z0(fi)  ~  4T('|.  ^  ~  ( nj  +  n<Pi  0  ^  nj,  ^  n„. 

On  summing  Equations  (14)  and  (15),  we  have 

(16)  Pr[EX  =  e;  Sj(t2.  r3)  =  nj-  Wh-  =  n'a\RA\ 


rtj  + 

nj  +  4 


.  (rt'  +  H  ) 

0o  J  ' 


,,T  (z0(ti)  -  4  (ft.  /2>  -  p)  •  L  0<>O») 

p>e 


for  <*  -  0,  1 . z0(r,)  -  </§■  (/,.  t2)  -  (nj  +  «;),  and  0  ^  nj  +  n„  <  n,  +  na 


(ii):  nj  +  n^  -  /»,  +  «CT(>0),  all  the  requisitions  in  (t2.  f3J  are  satisfied  by  time  r3.  In 
this  case  we  have 

(17)  Pr[EX  -  e\  Nj(i2,  r3)  -  nj\  N'a(t2,  l})  -  n^\RA) 

-  Pr\(Yx  +  Y2  +  ...  +  Y„j+„)  <  z0(/,)  -  4'('i-  r2)) 
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u+n, 

£  <t>  o'  (ph 


for  e  -  0,  nj  =  np  nj,  =  n,,. 


«/  +  n„  >  1; 

1,  for  e  =  r0(r,)  -  rff  (r,,  r2), 

4  =  0; 

0,  otherwise. 


From  Equations  (8-10)  and  Equations  (11-17)  we  can  obtain  Pr{(J/(t)  =  For 

computational  convenience,  however,  we  consider  the  following  two  cases:  one  where  all  the 
demands  from  base  j  during  (r2,  f3]  are  satisfied  by  time  r3,  that  is,  d  =  0;  and  the  other  where 
some  demands  from  base  j  during  ( i2 ,  r3]  remain  unsatisfied  by  time  r3;  that  is,  d  >  1. 


(a):  Pr\Uf(t)  «  0|7?/4 ):  From  Equations  (9)  and  (10)  we  conclude  that  given  RA, 
U}(t)  —  0  if  and  only  if  nj  =  rtjC^O).  For  the  case  when  all  the  requisitions  at  the  depot  in 
(r2,  r3]  are  satisfied  by  time  r3;  that  is,  when  nj  +  nj,  =  nt  +  n„,  Pr[V1(t)  -  0|/?/l)  =  1.  Then 
from  Equation  (17),  we  have 


Pr{U}(t)  =  0|/?/4  1  = 


1 

I  *o" °\p) 

P~"j  +  r<a 


for  ny  +  n„  =  0 
for  nj  +  n„  >  1. 


In  other  words, 

(18)  PriUjHl)  =-  0|R/4|  =  PRO  1  (say) 


V'l>-4<'|'2> 


&*'• ’(p )• 


On  the  other  hand,  when  nj  +  n'  <  rij  +  n that  is,  when  not  all  the  requisitions  at  the  depot 
during  (t2,  r3)  are  satisfied  by  time  r3,  then  n,  =  nj  implies  that  nj,  <  n„.  From  Equation  (15), 
after  simplification,  we  have 

(19)  Pr{U/(t)  -  0 1 /?>4 )  —  PRO!  (say) 


I 

*;-o 


n„\(nj  +  4)! 


n^Anj  +  na)'. 


£  <^o  '  °  —  <4 <t)  ~  '  J,  d>o(p) 

e  p>e 


where  the  range  of  summation  for  e  is  from  0  to  r0(f])  -  do  (f|.  h)  -  n,  -  nj,. 


(b):  Pr[Uj2(t)  -  d\RA\,  d  >  1:  From  Equations  (9)  and  (10)  it  is  clear  that  given  RA, 
—  d(^l)  if  nj  <  n2  and  thus  nj  +  na  <  n,  A-  n„  for  nt  >  1.  From  Equations  (8-10) 
and  Equations  (14-16),  it  follows  that 
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(20)  PriU/ii)  =  d\RA )  =  PRD  (say) 


V 1 

I  I 


-  4('u  h)  -  e) 


"j  +  na 

n'j  +  n„ 


^ - - ,J - 77  £  4j(k  +  e)  ■  <t>)nr"r') (d  -  k) 


(rij  +  na)  -  ( n'j  +  na) 


n„  ~  n„ 


,  .  >  f  .  -rr  £  *»</»> 

(nj  +  nj  ~  (n,  +  n„) 

where  the  range  for  summation  of  e  is  from  0  to  z0(fi)  -  4 (fi.  ~  nj  ~  n'a- 


Now  from  Equations  (18),  (19)  and  (20),  we  obtain  Pr{llJ(t)  =  d\D$ (r1(  t2)  =  d fj  (rh 
t2)\  ZQ(t\)  =  z0<ri) }  by  enumerating  over  /Vy(r2,  t2)  and  Na(t2,  i2).  Substituting  the  resulting 
expression,  and  Equation  (7)  into  Equation  (6)  we  obtain  Pr[Uj(t)  =  Sy  +  b)A.  Thus 


(21)  Pr{UjU)  =  Sj  +  b }, 

20('l* 

=  £  \cp[Sj  +  bWfRj  +  Xy°ry.  <bj]  ■ 


£  (PKOl)  •  p[rij  +  nCT|A0/?0] 


+ 


£  £  {PROD  ■  PlnjK^Ro]  ■  P[/7ylx°*0l 

n,-0  na-\ 


sy  +  t> 

+  £  CPU;  +  b  ~  d\\fRj  +  X,°T;,  </>y]  ■ 
d-l 


I 

I 


£  {PRD)  •  />[n(r|A^/?0]  •  />ln,IX;°/?o] 

nv- 0 


CPfrfo^l"  f2)  IXqXto  —  Rq), 


4.2  (B):  Pr {*/,(»)«  5,  +  ft}*. 

To  obtain  the  expression  for  Pr{Uj{t)  =  s,  +  6]fl,  we  note  that  in  case  B  all  demands 
levied  on  the  depot  during  interval  {t2,  /]  remain  unfilled  by  time  t.  In  addition,  some 
demands  from  the  base  /  during  the  interval  (/lt  r2]  may  remain  unfilled  by  time  t}.  Let  for 
this  case 

Uj{t)  -  the  sum  of  units  in  repair  at  base  /  at  time  f  and  the  units  for  which  orders  were 
placed  on  the  depot  by  base  /  during  (r2,  rj;  that  is,  U'(t)  -  Df{i  -  RJ:  t )  + 
Df{t2,  1)  +  Df(t2,  /); 

and 

Uj2{t)  -  the  units  ordered  from  the  depot  by  base  /  during  0|,  r2]  that  are  unfilled  by 
time  t}. 
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Obviously  Uft)  =  Uf(t)  +  Uf(t),  and  U}(t)  and  Uf(t)  are  independent.  Therefore,  the  pro¬ 
bability  distribution  of  Uft)  can  be  obtained  through  the  convolution  of  Uf(t)  and  Uf(t). 
The  probability  distribution  of  Uf(t)  can  be  readily  obtained  while  that  of  Uf(t)  involves  con¬ 
sideration  of  the  sequence  of  arrivals  of  requisitions  at  the  depot  during  (/,,  r2].  Noting  that 
U/(t)  is  independent  of  Z0(/,)  and  Dq  ( f ( ,  t2)  and  that  Pr{Uj'(t)  =  k)  = 
CP[k\\fRj  +  XjHr,  +  r0),  <t>j\  from  Equation  (5)  we  have 

(22)  Pr[Uj(t)  =  s,  +  b)B 


sj+h 

£  CPIs,  +  b  -  d'.kfRj  +  \,°(t,  +  R0),  4>j] 

d- o 


•  Pr{Uf(t)  -  d\Df  (/,,  t2)  =  df(rh  r2);  Z0(/,) 


•  /V{D^(/|,  r2)  —  d2(i |.  r2)S- 

We  shall  obtain  Pr[U2(t)  =  d|Do  (rt,  r2  =  r/o  (t],  r2);  Z0( rj )  =  c0(/])}  by  further  conditioning 
on  t2),  2V,°( Tj ,  t2)  and  D,°(/|,  t2)  for  /  =  j,  <j.  We  temporarily  denote  do  (/,,  r2)  by  df. 

Let 

/?SI  =  t2)  =  n,r;  A fRiii,  r2)  =  w/2.  (for  /  =  ./,  a-);  Z0(rj)  =  z0(/|)}, 


«D2 


X  Df( t , ,  ,2)  =  Do  (/|,  r2)  »  4  ;  I  D/></,.  r2)  = 


and 


=  KD1  t/  Kfl2 


Obviously,  then 


(23) 


»r+.vC)  lftD*nD ) 

Pr{RB2\RB\)  =  <jb0r  ;  '  (dp  ■  <t>f  '  "(df). 


To  obtain  />r|D/2(r)|/?D}  we  need  to  consider  the  number  and  type  (depot  repairable  or  con¬ 
demned)  of  requisitions  that  were  placed  by  the  depot  during  (f|,  r2l  from  each  source  and  are 
completely  satisfied  by  time  r3.  For  /  =  j,  <r,  let 


Nf(t j.  i2)  “  the  number  of  condemnation  type  requisitions  that  were  placed  at  the  depot 
during  U\,  r2l  bv  source  /  and  are  completely  satisfied  by  time  r3; 

Nf(i |,  t2)  —  the  number  of  depot  repairable  type  requisitions  that  were  placed  at  the 
depot  during  (tx,  r2]  by  source  /  and  are  completely  satisfied  by  time  r3 


Now  suppose  we  are  given  RB ,  IV;c(f|,  /2)  -  nf  and  Nf(t\,  t2  -  nf  for  i  —  j,  <r.  Then 
Uf(t)  -  d  if  and  only  if  the  sum  of  the  demands  due  to  unsatisfied  (nf  +  nf  —  nf  -  nf) 
requisitions  and  unsatisfied  units  of  a  possibly  partially  satisfied  requisition,  if  from  source 
(base)  j,  equals  d,  Here  in  case  B  we  note  that  0  <  nf  +  nf  +  nf  +  nf)  < 
(nf  +  nf  +  nf  +  nf).  Further,  we  introduce  an  indicator  variable  /  to  indicate  the  type  and 
source  of  the  partially  satisfied  requisition.  For  /  -  j,  cr  let 
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/ 


iC.  if  the  partially  satisfied  requisition  is  of  condemnation  type 
and  is  from  source  / 

iD.  if  the  partially  satisfied  requisition  is  of  depot  repairable 
type  and  is  from  source 


Also,  let  EX  again  denote  the  number  of  units  supplied  to  the  partially  satisfied  requisition. 


Then 

proceeding 

in  the  manner 

similar  to  that  used  in 

deriving  Equations  (8),  (9)  and  (10),  we 

get 

(24) 

Pr{U,Ht) 

-  d\RB: 

EX 

Ml 

0;  (NfUx.  t j)  -  nf 

;  Nf(ix.  t2)  -  nf. 

/ 

“  J.  cr  ))  -  <t> ,  '  ' 

’"’(d) 

(25) 

Pr[  UfU) 

=*  d\RB, 

EX 

e  >  0;  l\fUx.  r2)  ■ 

•*  nf ;  N'f(tx,  t2)  —  nf. 

(d)  for  8  =  cr  C,  cr  D 

i  -  J.  cr 

),  / 

= 

S|  —  <6,  '  '  '  ' 

and 

(26)  Pr\Uf(t)  -  d\RB,  EX  -  e  >  0;  (Nf'U,.  i2 )  =  nf  N'fU i2 )  -  nf.  i  =  j.  cr)- 
/«8)~  %4>,(k  +  e)<t>,  '  '  '  '(d-k)  for 8  -  jC,  jD 

k>  0 


To  compute  Pr{  Uf{t)  *=  we  now  need  to  obtain  Pr\EX  *  e,  Nf(tx.  t2)  —  n'c,  Nf(tx, 

t2)  —  n'°,  i  —  7,  o);  /  —  8|/?fl).  This  will  be  done  following  the  approach  used  in  deriving 
Equations  (14-17),  and  using  the  results  on  sequences  of  Poisson  arrivals  (see  Shanker  [5]). 
The  probabilitj  of  exactly  nf,  n'f,  nf,  nf  requisitions  out  of  nf,  nf,  nf  nf,  respectively,  being 
satisfied  gr.-n  »hn  a  total  of  (nf+  n'f+  nf+  nf)  out  of  nf  +  nf  +  nf  +  nf)  is  satisfied,  is 
given  by 


(27) 


PS  - 


D  I 

a  I 

/DI 

<7  | 

I  nf  +  nf  +  nf  +  nf 

1  nf+  nf+  +  n«  ' 


Let  {  T1p  y2 . Y  c)  bfe  the  sequence  of  the  number  of  units  demanded  from  the  depot 

ni 

by  the  nf  requisitions  from  source  i.  Similarly,  let  { Yx.  Y2 .  Yno)  be  the  sequence  of  the 

number  of  units  demanded  from  the  depot  by  the  nf  requisitions  from  source  /(<-  j,  cr). 
Then 

(28)  Pr[EX  ~  (N'fUx.  l2)  -  nf,  N'f(tx.  t2)  -  nf.  i  -  j,  o-);  /  -  kC|/?S),  k  -  j.  <r 

-  Pr[(  Yx  +  Y2  +  . . .  +  Ync)  +  ( K,  +  Y2  +  . . .  Ym  .p)  +  (Ti  +  Y2+  ...+  Y^) 


+  (K,  +  Y2+  ...  +  y.o)  -  z0(t,)  +  O',  +  T2+  ...  +  Y n)  +  (T,  +  Y2  + 

n  a  nf 

...  Y  D)  —  e,  Y  .c..  >  e,  out  of  the  first  (nf+  nf+  nf+  nf)  requisitions  at 

n9  n  k  -t  i 

the  depot,  (nf+  nf)  are  from  source  /,  the  next  requisition  is  of  condemna¬ 
tion  type  and  is  from  source  kl(K]+  Y2  +  . . .  +  YhD)  +  (T,+  Y2  + 

...+  Yn)-  dt  (Tt  +  Y2  +  ...+  Yr)  +  (K,  +  Y2+  ...+'  Y  ,  -  4;  RB 1) 
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C  'C 

«*  ~  4 


( nf  +  nf  +  4  +  nf )  -  ( n'f+  nf+  nf+  nf) 


PS 


I<4 

k 


c  '  °  ( ( / . )  +  k ,-e) 


£<t>f(e  +  Ac2)  ■  4o  "  '  "  (4  -  2g(fi)  ~  k ,  -  A;2) 


<4 


l/lA/l  ®)  <nD+rtD-tt  D-r  °) 

P  7  "<*,)■* 0°  7  7  "(<#-*■> 


_  -rn_/  \ n.  f 

4 0°  7  "(4)  <4  7  (4) 


where 


4^  400),  4<  40 1);  k  —  j,  (r 

e  -  0.  1 . (z0(fi)  +  d§  -  ( nf+  nf+  nf  +  >4)), 

and  the  ranges  of  Ac,  and  Ar2  are 

max(4+  4.  4+  nf-  r0(/,)  +  *i  ^  Act  <  4  -  (nf+nf  ~  n'/2-  nf), 

1  <  Ac2  <  min{4~  zo(*i)~  *i-  (nf+nf  -  n'c-  nf).  do  -  e-  (4+4  -  nf-  4)). 

Similarly  the  probability  distribution  when  the  partially  satisfied  requisition  is  depot  repairable 
type,  is  given  by 

(29)  /V(£*  ~  e;  (N;c(f,.  r2)  -  nf,  N'fU,.  t2)  -  4;  /  -  j,  <r);  /  -  it  -  j.  <r 

PS 


„D  _  'D 


(4+4+  4  +  4)-  (4+  4+  nf+nf) 


<n 

(zo(f|)  +  *i~e) 

<  n  P+  ai® -  * ft' "£• -  1) 

J^4f(e  +  k2)4o  (do  —  k\  —  k2—  e) 

k2 

(4-z0(/,)-*|+e)  / 


(j»^+i»P)  (»4+  ) 

40°  ( df)w4o  (do ) 


<4  '  (*.)  *oC  7  7 

where 

4^  400).  4<  401);  Ac  -  j,  <r;  (<4“  4 j ) 

e  -  0,  1 . (z0Oi)  +  do  -  (nf  +  nf+  nf  +  nf)), 

and  the  ranges  of  kt  and  Ac2  are 

max{4+  nf,  nf+  nf-  z0(f|)  +  e)  ^  Aci  <  4  ~  zo('i)  +  *“  <4  +  4  ~  4“  4°). 

1  <  Ac2  ^  4  -  Ac|-  e-  (nf+nf  -  nf-  nf). 


The  probability  distribution  of  UfU)  can  now  be  obtained  from  Equations  (23-29).  Let¬ 
ting  PS1  -  (4  +  nf  +  4  +  4)  —  (nf +  nf +  4  +  nf)  and  after  simplifications  we  have 
from  Equations  (22), 
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s.+b 


(30)  Pr[U,U)  =  s,  +  b)B=  £  I  C/)[s/  +  b  ~  d^k>R'  +  X°(t>  +  Rah 

4>’o«'i>  d~° 


IZII 


nD  „C  nO  „( 

i 


mi 


.  D  „'C  n  D  „  C 

1 a  n<r  nj  n, 


PS 


PS  1 


II  l(n£  -  nf)US(c t.  C) 


dj>  < 


+  («,?  -  n;rD)US((T.  D)  +  (<-  n'c)f/SO,  C)  +  (nf~  n^l/SC/.  0} 
■  PI»;U«'(t0  -  «0)1  ■  />(<IXyD(r0  -  R0)\  •  P[nf;\K^r0  -  R0) ] 

■  P\nD^(r0-  R0)) 


where 


in  k+n  X \ 


US  (a,  C)  =  £{<4  '  "  (z0(m)  +  A:  |  -  e)  •  <*>0D  ’  <*i> 


I  <£i  (e  +  X2) 


X  tp-rikj  ■  tS"""'"  "(4  -  z0(/,)  -kx-k7-  fc3) 

*,-o 


•  -  *3)  •  ^ ,<r  ""  Wo0  -  *■  -  d  +  *3>] 


I 


US  {a.  D)  =  I 


</>o  (  "  (zqOi)  +  Xi  —  e)  •  <i(f 


i.M ) 

D  '  17 


I*>  +  *2> 
^  •> 


(«< 


£  •  *«■”'  "  (4  -  z0(»i)  ~  *1  -  *3  +  e) 


o 


•  ^"’"hd  -  k})  ■  "idR  -  k\-  k2  +  e  -  d  +  k}) 


US(j,  C)  =  £ 


<4  '  '  (z0(/|)  +  *,-*>•  '  (X,) 


£  <£,(e  +  A:2) 


*,-o 


£^*r-iL,,(*J)  •  -  z„(/,)  -  *,  -  *i  -  *,> 
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'  "^(d-k,-k,)  -4>? 


(d§  -  A,  -  d  +  A,  +  k 3) 


and 


US(j,  D)=  £ 


t4>o  (z0(f|)  +  *1  -  e)  ■  <t>f  (A,) 


X  •M*  +  a2) 

*2-o 


\  ,  C<n®  " » 

J  y  (At3)  •  <6* 


(do  Zq('i)  ki 


k  3  +  e) 


(<*  -  A3)  •  <5j 


(<# 


-  A2  +  e 


—  +  A?  +  A3) 


The  ranges  of  for  enumeration  in  US(<t,  C )  and  US(j,  C)  are  as  given  in  Equation  (28), 
and  those  in  USitr,  D)  and  US(J,  D)  are  given  in  Equation  (29).  The  ranges  k2  in 
US  (a,  C)  and  US  (a,  D)  are  also  given  in  Equation  (28)  and  (29),  respectively.  The  ranges 
of  A3  in  the  US(j,  C)  and  US(J,  D)  can  similarly  be  obtained  to  ensure  that  d>,0'(0)  =  1  and 
4>im)(n)  =  0  for  m  <  n.  The  ranges  of  nf,  nf,  rtf and  n'rare  similarly  taken  to  ensure  that  ihe 
combinatorial  terms  in  PS  are  nonnegative. 


Since  the  expressions  for  Pr[Uft)  =  s,  +  b}A  and  Pr[U)(t )  =  s;  +  b\B  given  by  Equa¬ 
tions  (21)  and  (30),  respectively,  are  independent  of  /,  we  obtain  the  stationary  distribution  lim 

/—no 

PriBft)  =  b)  =  B*(b)  by  substituting  these  equations  and  Equation  (1)  in  Equation  (3). 

5.  STATIONARY  DISTRIBUTION  OF  IN-REPAIR  INVENTORY 


As  observed  in  Section  2,  we  note  that  the  stationary  distribution  of  the  number  of  units 
in  repair  at  base  j  is  given  from  the  results  of  a  A/1*'1//?/00  queueing  system.  As  mentioned 
by  Sherbrooke  [7],  the  distribution  is  a  compound  Poisson  with  parameter  kfR,  and  com¬ 
pounding  distribution  <bj( •),  [using  Palm’s  theorem];  that  is, 


(31) 


A  =  0,  1,  ...  ; 
j  =1,2,.../ 


Similarly,  the  stationary  distribution  of  the  number  of  units  at  the  depot  is  given  by. 


6.  STATIONARY  DISTRIBUTION  OF  DEPOT 
BACKORDERS/ON-HAND  INVENTORY 

As  mentioned  earlier,  the  depot  can  be  treated  as  a  single  location  which  receives  recover¬ 
able  and  nonrecoverable  types  of  demand  generated  by  independent  compound  Poisson 
processes  I Dff(t),  1  >  0)  and  \D^ (/),  1  >  0),  respectively.  Let  X0(t)  represent  the  depot 
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inventory  level  at  time  t  which  consists  of  the  units  on-hand  minus  any  backorders.  The  posi¬ 
tive  values  of  X0(t)  indicate  on-hand  inventory  while  the  negative  values  indicate  backorders 
B0(t)  at  time  r,  that  is, 

B0(t)  -  max(0,  -  X0(i)). 

It  is  more  convenient  here  to  obtain  the  stationary  distribution  in  terms  of  X0 (t)  than  for  B0(t ) 
directly.  Let  Ex  -  (S0,  S0  -  1,  ...  1,  0,  -1,  -2,  . . .}  denote  the  state  space  of  X0(r). 

Now,  any  thing  on  order  from  an  external  supplier  at  time  t  —  t0  will  have  arrived  by 
time  /,  thus  the  number  of  units  received  via  procurement  during  (/  -  t0,  t]  is  the  number  of 
units  on  order  at  time  /  -  r0.  The  number  of  units  arriving  from  the  repair  shop  during 
(/  -  r0,  i ]  is  the  number  of  units  repaired  during  this  interval,  which  equals  Q0(t  -  t0 ) 
Dq  (i  =  r0,  t)  -  Q0(t).  Therefore, 

Xq it)  -  T0(/  -  t0)  +  (units  on  order  at  time  t  -  r0)  +  units  repaired  during 

(r  -  r0,  /]  -  (total  demand  during  (t  -  r0,  t]) 

=  X0(t  -  t0)  +  (units  on  order  at  time  t  -  t0)  +  Q0(t  -  r0)  +  D§(t  -  r0,  /) 

-  Qo(D  -  D§(t  -  r0,  t )  -  £>£ (/  -  t0,  t) 

*  Z0(t  -  t 0)  -  Q0(t)  -  DfiU  -  r0,  (). 

Then  for  all  /  €  E0  and  j  6  Ex , 

(33)  Pr{X0U)-j\Z0(0)-i} 

-  X.  Pr{X0(t)=j\Z0(t-r0)~k,  Z0(0)-/}-/V{Z0(/-ro)-*|Z(0)-/) 

*€£0 

-  £/,/-{Zo(/-T0)-O0(r)-Z)§'(r-T0,  i)-j\Z0(t-T0)-k.  Z0(0)  -  /) 

*€£0 

■  Pr[ZQ(t  -  r 0)  -  k  IZq(O)  -  /) 

“  £  Y,Pr[Qo(t)  + D${t- to,  t)- k-j\Q0(t-T0)“  m,Z0(t-T0)- k\ 

A€£q  m—0 

Z0(0)-  /}  •  Pr[Q0(t-T0)  -  m|Z0(t-T0)  -  k\ Z0(0)-  /)j 

•  Pr{Z0(t  —  T0)  —  Ac  |  Z0(0)  —  /). 

It  can  be  seen  that  Qo(t)  and  Z0O)  are  independent  for  any  t  >  0  (see  for  proof  reference 
(5|).  Then  upon  simplification  we  get 

(34)  Pr[X(t)-j\Z0(t  -  t0)  -  k,  Z0( 0)  -  »}  -  Pr[D$ (/  -  r0>  t )  +  Q0U)  -  k-j\Z0(0)  -  t) 

-  Pr\D§ (t  -  t0,  t)  +  Q0(t)  -  k  -  y|Z0(0)  -  /} 

“  *£  PrlQoU)  -  k-j  -  d\Z0m  -  /)  •  Pr\D$(t  -  r0,  t)  -  d). 

d-0 
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Substituting  Equation  (34)  into  Equation  (33)  and  after  taking  the  limit  as  /—•<»,  we  get 
JfjJO')  -  lim  Pr[X0it)  =  ./|Z0(0)  =  /) 

/— oo 

=  £  £  lim  /V{£>0(r)  =  *  -  y  -  <*|Zo(0)  =  /) 

*€f0  l</-0 

■  lim  Pr{Do  ( l  —  r0.  r)  =  d)l  ■  v0(k). 

r— oo  J 

From  Equation  (32)  then  we  have 


(35)  *50)-  £ 

kt£n 


£  CPU  -  7  -  d\k0DR0.  *0°]  ■  CP[d\k£r0,  <t>^\ ■  n0ik) 
a-o 


for  j  =  S0,  S0-  1,  ...  1,  0,  -I,  -2,  ... 


or  in  terms  of  the  backorders 

{k  +  ft 


(36)  B$lb)~  £ 


X  CPik  +  b  -  cIWqRq,  <t>$ ]  ■  CP[d|\0cr0,  <6<fl 


a-o 


nn(k) 


for  b  =  -S0.  -S0  +  1.  ...  0.  1,  ... 


7.  SPECIAL  CASES 

The  results  on  stationary  distributions  for  special  cases  of  complete  recovery  (p,  •  1, 

j  ”  l.  2 . 7)  and  nonrecoverability  (p,  «  0,  ry  -  0;  j  -  1,  2 . J)  can  be  derived  from 

the  expressions  obtained  in  Sections  (4-6).  For  complete  recovery  when  there  are  no  condem¬ 
nations  and  the  system,  as  referred  to  by  Sherbrooke,  is  ‘conservative’,  no  procurement  is 
made  from  the  external  supplier.  The  inventory  position  at  the  depot  remains  at  a  constant 
level  S0(say);  that  is,  Z0(t)  =  S0  for  all  t  ^  0.  Since  £>o  (f)  =  0  for  all  t  >  0,  the  Case  (B) 
discussed  in  Section  4.2  will  not  arise  and  consequently  Equation  (3)  reduces  to 
Pr[Ut(t )  —  Sj  +  6|  -  Pr{Uj(t )  -  s,  +  b) A  with  z0U\)  —  S0.  Upon  substituting  \0C  “  0,  and 
z0(f)  -  S0  in  Equation  (21)  we  can  obtain  the  stationary  distribution  B*{b).  The  stationary  dis¬ 
tributions  of  in-repair  inventory  are  given  by  Equations  (31)  and  (32).  Equations  (35)  and 
(36)  similarly  can  be  appropriately  modified  to  yield  stationary  distribution  of  on-hand  inven¬ 
tory  and  backorders,  respectively. 

For  the  classical  case  of  nonrecoverability,  that  is,  when  the  item  is  consumable,  repair 
loop  is  absent  at  each  location  in  the  system,  and  Qj(t)  =  0  for  t  ^  0  and  j  —  0.  1.  . . .  J.  The 
stationary  distribution  of  (Z0(r),  /  >  0)  can  be  obtained  by  noting  that  inventory  position  at 
the  depot  now  changes  at  all  arrival  epochs  of  demands  at  the  bases.  (£>0(r).  t  >  0)  is  a  com¬ 
pound  Poisson  process  with  parameter  kQ  —  X  *,«  and  compounding  distribution 
1  7  J~' 

<j>n(  )  -  —  •  XM,<  >-  Equation  (1)  can  then  be  modified  to  give  the  stationary  distribution 

in  this  case  by  replacing  rf>n  (•)  with  For  determining  B'(b),  we  note  that  the  cases  (A) 

and  (B)  become  dQ(ty,  r,)  <  ;0(r|)  and  da(t\.  r})  >  rnU|).  respectively.  Equations  (21)  and 
(30)  can  be  modified  to  give  8*(6).  Similarly,  *«(&)  can  be  obtained  from  Equation  (36)  as 
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(37)  t0*0')  =  I 

*€£„ 

a  result  quite  familiar  in  classical  consumable  item  inventory  systems. 

The  results  for  the  case  of  unit  demand,  that  is,  <£,(1)  =  1,  j  =  1,  2 . J  can  be  simi¬ 

larly  derived  by  noting  that  <f>min)  =  1  for  m  =  n  and  <J>min)  =  0  for  m  ^  «,  and  that  £>y ( r ) , 
Dfit),  Dfit ),  DjD(t )  are  identical  to  Nj(t),  Nfit),  Nfit),  N^ir),  respectively  are  are  simple 
Poisson.  Consequently,  £>0(r),  Z)£(r),  (r),  Z)£(r)  are  identical  to  A'o(r),  A$(r),  A# (/), 

Noil),  respectively,  and  are  simple  Poisson.  The  depot  inventory  position  Z0(r )  is  then  uni¬ 
formly  distributed  over  £0.  Equations  (21)  and  (30)  can  be  modified  easily  to  yield  B*ib),  and 
it  can  be  seen  that  the  resulting  expressions  are  the  same  as  obtained  by  Simon  [8).  For  the 
unit  demand  case,  the  cases  of  complete  recovery  and  nonrecoverability  can  be  dealt  in  a 
manner  similar  to  the  one  discussed  above.  For  the  case  of  complete  recovery,  the  results  are 
the  same  as  given  by  Sherbrooke  [7]  for  unit  demand  at  the  bases. 

The  results  for  the  special  cases  described  above  are  obtained  by  suitable  modifications  of 
the  expressions  derived  in  Sections  (4-6).  The  details,  however,  can  be  found  in  references  [5] 
and  [8]. 

8.  CONCLUSIONS 

The  exact  expressions  for  stationary  distributions  of  the  depot  inventory  position,  and  of 
the  number  of  backorders,  the  on-hand  inventory,  and  the  in-repair  inventory  at  each  location 
have  been  derived  under  the  conditions  of  deterministic  repair  and  lead  times.  From  these 
expressions  we  can  determine  long-run  average  to  formulate  objective  function  of  the  total 
expected  cost  and  can  express  the  system  performance  measures  such  as  service  rate,  ready  rate 
etc.  for  the  purpose  of  system  optimization.  The  expressions  obviously  are  computationally 
complex  as  they  involve  calculations  of  several  combinatorial  expressions  and  convolutions. 

The  results  can  also  be  used  to  assess  the  degree  to  which  some  approximate  but  compu¬ 
tationally  simpler  models  can  serve  as  an  approximation  to  our  exact  results.  For  example,  for 
a  unit  demand  case  in  a  two-echelon  conservative  system,  a  study  (reference  16])  on  compari¬ 
son  of  the  exact  results  with  those  given  by  Sherbrooke’s  METRIC  model  which  is  approximate 
but  much  simpler  to  use,  reveals  that  a  considerable  discrepancy  exists  between  the  two  results 
especially  when  the  depot  spare  stock  level  is  low  or  when  a  major  portion  of  the  repairs  is  car¬ 
ried  out  at  the  depot.  For  the  problem  of  optimal  allocation  of  units  of  a  spare  item,  it  has 
been  concluded  in  reference  [6]  that  the  METRIC  model  will  suggest  larger  stocks  at  the  bases 
with  poor  repair  capability,  than  given  by  our  exact  results.  Similar  comparisons  can  be  made 
for  the  problem  of  allocation  of  units  in  a  multi-item  system. 

The  results  of  the  present  analysis  also  apply  to  the  situation  where  the  three  types  of  sys¬ 
tem  demands  (base-repairable,  depot-repairable,  condemnable)  at  the  bases  arrive  indepen¬ 
dently  in  a  Poisson  manner  from  different  sources. 
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ABSTRACT 

The  problem  of  estimating  the  shape-parameter  of  a  distribution  is  con¬ 
sidered.  We  introduce  a  class  of  estimators  the  distributions  of  which  are  in¬ 
dependent  of  location  and  scale.  An  estimator  proposed  by  Weiss  [ll  is  a 
member  of  this  class.  We  find  the  asymptotically  most  efficient  estimator  in 
this  class  which  differs  from  that  proposed  by  Weiss. 


1.  INTRODUCTION 

Let  Xni  <  X„  i  ^  ...  ^  XH„  be  the  order  statistics  from  a  sample  of  size  n  from  a  distri¬ 
bution  function  F  with  density  f(x).  We  assume  that 

0.1)  fix)  —  0  for  x  <  9 

fix)  -  c(x  -  0)y~1  (1  +  rix  -  9 ))  for  x  >  9, 

where  c,  9.  y,  r{y)  are  all  unknown  except  that  we  know  c  >  0,  y  >  0  and  |r(_y)  |  ^  Ky^  for 
all  y  in  some  interval  10.  A],  where  K,  1 3,  A  >  0,  again  unknown.  Suppose  we  want  to  esti¬ 
mate  y  on  the  basis  of  Xn  i . X„  k(n),  where  kin )  -•  »  as  n  -  »  and  kin)/n  —  0.  Weiss 

[1]  suggests  the  following  simple  estimator  for  the  shape-parameter  8  =  y-1 

(1.2)  S  -  log  -  --*"1  ~ ■  /Jog  2, 

-  Xn:l 


which  is  consistent  for  6  but  is  neither  asymptotically  efficient,  nor  is  it  optimal  in  any  ordinary 
sense.  The  purpose  of  this  note  is  to  suggest  a  similar  estimator  which  is  more  efficient  and  is 
optimal  in  some  sense. 

2.  THE  OPTIMAL  ESTIMATOR 

It  is  shown  in  Weiss  (1)  that  under  (1.1),  for  all  asymptotic  probability  calculations,  we 
can  assume  that 

(2.1)  X',-9  +  [iz„  +  ...+  Z,)/8c«}4. 
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where  Z,,  Z2.  ...  are  independent  and  exponentially  distributed  random  variables  with  mean 
1,  »—  1.2 . kin),  lim  kin)  —  °°,  lim  kin)/nt  =  0  for  all  «  >  0.  Let  min)  be  an 

#i— oo  n— oo 

integer,  min)  <  kin)  and  define 


8 


m  {« ) 


log 


Xft  .k  (n ) 


/log 


k  in) 
m  in)  ' 


Using  the  representation  (2.1)  and  the  law  of  large  numbers,  for  every  sequence  min)  for 
which 


(2.2)  lim  min)  *=  °°  and  lim  ikin)  -  min))  -  «> 

n—  oo  rt— °o 

the  estimator  8m<„)  is  consistent  for  8.  Clearly  8  in  (1.2)  is  a  special  case  of  8m(n)  with 
min)  =  [kin)/ 21.  Although  this  choice  of  min)  is  intuitively  appealing,  it  is  not  a  very  good 
one.  Suppose  that 

(2.3)  lim  vrr-  -P  (0  <  p  <  1). 
n  kin) 

Then,  it  is  not  hard  to  show  that  the  asymptotic  variance  of8m(n)  is  given  by 

(2.4)  lim  kin)  Var8m(n)  —  8J(1  —  p)/ip  log2p)  =  82t/(p). 


The  function  Vip)  has  a  J  shape  on  (0,  1)  and  attains  its  minimum  at  p0  =*  .2032.  Hence,  in 
the  sense  of  minimum  variance,  the  optimal  choice  of  min)  will  be  m(n)“ 
[p0M«)l  =  kin)/ 5. 


Finally,  the  results  hold  under  milder  conditions  than  considered  by  Weiss  Ml.  Namely, 
it  is  enough  to  assume  that  Fis  regularly  varying  at  0: 


lim 

>10 


FiO  +  ty) 
Fi9  +  y) 


ty  for  all  t  >  0. 


3.  CONCLUDING  REMARKS 

The  shape-parameter  8  -  \/y  is  a  scale  parameter  for  8m(n),  whose  distribution  does  not 
depend  on  0  or  c.  Thus,  81(,o/(<n)]  is  optimal  (among  all  8(M((,)|,  0  <  p  <  1)  uniformly  in  9,  c 
and  8.  The  optimality  is  preserved  if  we  use  1/8 (n)|  to  estimate  y.  The  asymptotic  relative 
efficiency  (ARE)  of  8(J*((,)|  with  respect  to  8^o*(„(|  is  at  least  90%  in  the  interval  p  €  1.1..35] 
and  only  74.2%  for  p  —  .5  (see  Table  1). 
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TABLE  1  —  The  asymptotic  relative 
efficiency  ofi[pkU)\  with  respect 
to  6iPo*un  On  percents) 


p 

ARE 

P 

ARE 

0 

0 

.20 

loo.o 

.01 

33.1 

.30 

95.9 

.02 

48.2 

.40 

86.4 

.03 

58.7 

.50 

74.2 

.04 

66.7 

.60 

60.4 

.05 

72.9 

.70 

45.8 

.06 

78.0 

.80 

30.8 

.07 

82.2 

.90 

15.4 

.08 

85.7 

.95 

7.7 

.09 

88.5 

.99 

1.5 

.10 

91.5 

1.00 

0 
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ABSTRACT 

Numerous  procedures  have  been  suggested  for  solving  fixed  charge  prob¬ 
lems.  Among  these  are  branch-and-bound  methods,  cutting  plane  methods, 
and  vertex  ranking  methods,  fn  ait  of  these  previous  approaches,  the  pro¬ 
cedure  depends  heavily  on  the  continuous  costs  to  terminate  the  search  for  the 
optimal  solution.  In  this  paper,  we  present  a  new  branch-and-bound  algorithm 
that  calculates  bounds  separately  on  the  sum  of  fixed  costs  and  on  the  continu¬ 
ous  objective  value.  Computational  experience  is  shown  for  various  standard 
test  problems  as  well  as  for  randomly  generated  problems.  These  test  results 
are  compared  to  previous  procedures  as  well  as  to  a  mixed  integer  code.  These 
comparisons  appear  promising. 


1.  INTRODUCTION 

One  class  of  mathematical  programming  problems  that  has  been  of  continuing  interest  to 
researchers  is  the  linear  fixed  charge  problem  (LFCP).  The  LFCP  is  similar  in  structure  to  a 
minimization  linear  programming  problem  except  that  a  fixed  or  lump  charge  must  be  paid  if 
the  associated  continuous  variable  is  positive.  It  is  this  discontinuity  that  has  made  the  LFCP 
difficult  to  solve. 

The  LFCP  may  be  formulated  as  follows: 

(P)  maximize  CTx  +  FTy 

subject  to  x  €  S 

(l  if*,  >  0 
and  yj  -  |0  if  Xj  „  0 

where  5  -  {x\Ax  -  b,  x  >  0). 

If  n'  is  the  number  of  structural  variables  and  /  is  the  number  of  slack  and/or  surplus  variables, 
then  «  —  «'+/  is  the  number  of  *  variables.  Then  A  is  m  x  n\  C  is  n  x  1;  F  is  n  x  1  and 
nonnegative.  Both  C  and  F  have  the  last  /  components  equal  to  zero.  Finally,  b  is  a  nonnega¬ 
tive  m  x  1  vector.  The  c/s  are  the  continuous  or  "unit"  costs  and  the  //s  are  the  fixed  costs. 
Also,  let  N  -  (1,  . . .  ,  «)  and  M  -  {1,  . . .  ,  m}. 
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Previous  work  on  this  problem  has  centered  about  either  the  LFCP  with  a  general  con¬ 
straint  matrix  or  the  fixed  charge  transportation  problem  (FCTP)  (e.g.  [5]).  The  latter  has  the 
special  form  of  the  Hitchcock  transportation  problem.  In  all  cases,  work  has  sought  to  exploit 
the  result  first  shown  by  Hirsch  and  Dantzig  [4]  that  the  optimal  solution  to  any  fixed  charge 
problem  (if  it  exists)  will  occur  at  an  extreme  point  of  the  constraint  set.  It  is  not  as  easy  as  it 
may  seem  to  find  a  global  minimum  since  it  has  also  been  recognized  that  many  basic  solutions 
are  local  minima  [7], 

Work  on  the  general  version  LFCP  has  taken  three  different  directions.  Steinberg  [11] 
used  a  search  procedure  to  solve  problems  with  as  many  as  30  variables  and  15  constraints 
while  Taha  [13]  used  an  adjacent  vertex  cutting  plane  procedure  to  solve  problems  of  size 
15  x  20.  Finally,  McKeown  [8]  used  a  vertex  ranking  approach  first  suggested  by  Murty  [9]  to 
solve  both  general  and  transportation  fixed  charge  problems.  He  solved  general  problems  with 
as  many  as  20  structural  variables. 

Our  approach  to  the  problem  will  be  to  use  a  branch-and-bound  procedure  to  solve  the 
LFCP.  We  will  discuss  this  algorithm  in  Section  2  and  present  a  numerical  example  in  Section 
3.  Finally,  we  present  and  discuss  computational  results  in  the  last  section. 

2.  THE  BRANCH-AND-BOUND  ALGORITHM 

Our  branch-and-bound  algorithm  is  a  straightforward  application  of  the  procedure 
described  by  Geoffrion  and  Marsten  [3].  They  suggest  that  difficult  problems  such  as  (P)  can 
be  solved  by  a  systematic  process  made  up  of  separation,  relaxation,  and  fathoming.  The 
separation  generates  candidate  problems  (CPs)  which  are  added  to  a  candidate  list  (CL).  A 
(CP)  is  removed  from  the  (CL)  and  an  attempt  is  made  to  find  a  bound  on  its  optimal  solution. 
Usually,  a  relaxation  of  the  (CP)  is  solved  to  calculate  this  bound.  If  the  bound  is  worse  than 
the  value  of  a  feasible  solution  or  if  the  (CP)  is  infeasible,  it  is  possible  to  eliminate  the  (CP). 
If  it  cannot  be  eliminated,  the  (CP)  is  then  separated  also.  If  the  (CP)  is  discarded,  it  is  said  to 
be  fathomed.  This  procedure  continues  until  the  (CL)  is  empty.  At  any  point  in  the  procedure, 
the  best  feasible  solution  found  to  date  is  known  as  the  incumbent. 

The  unique  part  of  our  branch-and-bound  algorithm  is  based  upon  the  following  proposi¬ 
tions: 


PROPOSITION  1:  Consider  Problem  (Pc)  below  (the  continuous  portion  of  (P)): 

(Pf)  minimize  CTx 

subject  to  x  6  5 

Then,  if  (x*,  y*)  solves  problem  (P)  and  x  solves  problem  ( Pc ),  then  CTx  <  CTx*. 
PROOF:  Trivial  from  the  theory  of  linear  programming. 

PROPOSITION  2:  Consider  problem  (Pg)  below 
(P8)  minimize  £  fyj 

JCN 
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subject  to  £  S IJyJ  >  I  ,  i  €  M, 
.V,  >  0,  j  6  N 


where 


8y  = 


1  if  a„  >  0 


0  if  =$0  !n  ^  and  1  ' 


1  if  b,  >  0 
0  otherwise 


/  €  M. 


Then  if  ( x *,  y *)  solves  problem  (P)  and  >■  solves  problem  Ps.  then  Fry  ^  Frv  *. 


PROOF:  See  [8].  These  two  propositions  give  rise  to  the  following  result  which  defines 
the  relaxation  of  problem  (/>): 


PROPOSITION  3:  If  v (/»,.)  =  <Trx.  v(/»8>  =  Frv,  and  v(/»)  =  C^x*  +  fV,  then 
v(P,)  4-  v(Ps)  < vf/3). 


PROOF:  Follows  from  previous  results. 

Separation  occurs  by  selecting  a  variable  x,  which  is  basic  and  nondegenerate  in  (Pt.)  and 
which  has  not  been  previously  used  for  separation,  and  alternately  setting  the  corresponding  y, 
value  to  0  and  1.  This  avoids  problems  with  infeasibility  since  we  know  that  any  solution  to 
( Pc )  is  also  a  solution  to  ( P ).  We  also  avoid  degeneracy  considerations  by  this  selection  of 
variables. 


ft  is  not  necessary  to  solve  (Pt.)  and  (Pf)  from  scratch  at  each  new  separation.  This  may 
be  avoided  by  using  the  optimal  tableau  for  each  of  these  problems  corresponding  to  the  (CP) 
being  separated.  If  y}  is  set  to  be  0,  this  implies  that  x,  equals  0  also.  The  new  tableaus  for 
this  branch  for  (Pc)  and  (Pg)  can  be  found  by  using  the  dual  simplex  algorithm  to  drive  y,  and 
Xy,  respectively,  out  of  the  optimal  basis  for  each  problem.  If.  in  either  case,  no  feasible  solu¬ 
tion  exists  with  x,  =  0  and  y;  =  0,  then  the  branch  is  terminated.  Otherwise  v{Pc)  and  v( Pb) 
are  equal  to  the  optimal  solution  values  of  the  respective  problems.  Obviously,  if  y,  is  not  basic 
in  (Ph)  for  the  (CP),  then  v(PB)  does  not  change. 


On  the  other  hand,  if  y,  is  set  to  1  then  x,  is  already  basic  and  positive  by  our  selection  of 
separation  variables  so  no  action  is  necessary  on  (f,.).  Hence,  the  optimal  tableau  for  the  (CP) 
is  also  optimal  for  the  new  problem  formed  by  the  separation.  In  the  case  of  (/V.  we  can  com¬ 
pute  the  optimal  solution  for  this  branch  by  noting  that  (PB)  is  in  actuality  a  relaxation  of  a  set 
covering  problem.  We  can  then  determine  the  effect  of  setting  y,  =  1  by  seeing  that  the  new 
set  to  be  covered  is  equal  to  1  -  8,,  for  i  €  M.  We  are  then  interested  in  solving  the  following 
modified  version  of  (PB)  which  we  will  term  problem  (P lfi): 


(/*lg)  minimize 
subject  to 


I  fM 

>U  2 

£  ^  r«  “  £  «./  '  e 
y)  >0  j  €  y2. 
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where  J0  =  I  j  \y2  =  0) 

J\  =  (y  \y,  =  1} 

and  J2  =  N~[JiUJ0 ). 


It  is  not  necessary  to  solve_(Fl8)  anew  at  each  separation  of  a  (CP).  If,  for  the  (CP),  Bh 
is  the  optimal  basis  for  (Pl8),  T  is  the  right-hand-side,  and  Ay  is  the  yth  column  of  the  con- 
straint_  matrix,  _  i.e.  |6y,  /  €  A/),  for  the  separation  variable,  then 
Bj-1  (f  -  Ay)  =  5s'1  T  —  Z?8~'Ay  is  the  new  right-hand-side  for  (Pl8)  for  the  y,  =  1  branch. 
This  allows  us  to  compute  a  new  right-hand-side  for  (/* ls)  at  this  branch  by  subtracting  the  yth 
transformed  column  from  the  right-hand-side  corresponding  to  the  (CP)  being  separated.  If 
the  new  right-hand-side  is  feasible  (^0),  then  the  present  basis  ( Bs )  is  optimal.  If  this  is  not 
true,  then  the  dual  simplex  method  can  be  used  to  attempt  to  achieve  a  feasible  solution.  If  no 
basic  feasible  solution  exists  for  the  new  right-hand-side,  this  branch  can  be  terminated. 

If  Z8  is  the  optimal  solution  value  to  (Pl8)  as  found  above,  then  v(P5)  =  Z8  +  V  /; . 

Also,  if  Z  is  the  value  of  an  incumbent  solution  and  for  any  branch,  v(P8)  +  v(P, )  ^  Z,  the 
branch  can  be  terminated. 

This  process  begins  by  solving  the  original  versions  of  (P8)  and  (Pc)  to  determine  initial 
optimal  tableaus.  These  tableaus  are  then  used  to  generate  successive  tableaus  as  described  ear¬ 
lier.  Rather  than  storing  the  entire  tableau  for  each  (CP)  in  the  (CL),  the  index  sets  of  the 
basic  variables  for  (Pc)  and  (P8),  which  we  denote  as  Bc  and  Z?8,  are  stored  as  binary  words. 
When  the  (CP)  is  removed,  a  "crashing"  routine  is  used  to  compute  the  desired  tableaus  for 
(Pc)  and  (Ps)  corresponding  to  the  index  sets.  By  crashing,  we  mean  forcing  the  variables  in 
the  index  set  into  the  current  basis  without  regard  to  intermediate  feasibility.  The  fact  that  the 
current  and  desired  tableaus  are  both  feasible  insures  that  the  desired  tableau  will  be  achieved. 

A  Linear  Programming  Lower  Bound 

Problem  (P)  may  also  be  formulated  as  a  mixed  integer  programming  problem  as  follows: 

(Put)  minimize  Z  =  CTx  +  FTy 

subject  to  x  €  S 

ujFj  ~  Xj  >  0  j  €  N 

yj  €  {0,  1)  j  €  N 


where  uy  >  xy  for  j  €  N. 

For  fixed  charge  transportation  problems  with  supplies  {5,}  and  demands  (£>,),  then 
/j.tj  —  min  (S,,  Dj).  Or,  if  the  A  matrix  is  nonnegative ,  the  uy  values  can  be  found  by 
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However,  if  the  A  matrix  does  not  satisfy  either_of  the  above  criteria,  then  we  must  compute 
the  Uj  values  by  other  means.  To  do  this  Jet  Z  be  the  the  value  of  an  incumbent  solution. 
Then  we  can  require  that  v(Pc )  +  v(Ps )  <  Z.  We  can  use  this  result  to  compute  upper  bounds 
on  the  Xj  using  the  following  linear  programming  problem: 

(Pu)  maximize  Xj 

subject  to  x  €  S 

CTx  <  Z  -  v(/>8). 

For  each  xJy  Uj  -  v(Pu). 

If  we  relax  the  integer  restriction  on  the  y}  variables  to  0  <  ^  1  in  PMh  we  have  a 

linear  programming  relaxation  which  yields  an  optimal  solution  value  Z,.  Obviously,  Z,  <  Z, 
so  we  could  use  this  as  a  lower  bound  in  a  branch-and-bound  procedure.  This  bounding  pro¬ 
cedure  will  be  compared  in  a  later  section  to  v(Pc)  +  v(Ps). 

3.  EXAMPLE 

As  an  illustration  of  our  branch-and-bound  solution  procedure  for  the  linear  fixed  charge 
problem,  consider  the  following  problem: 

minimum  4x,  +  2x2  +  3x3  +  2  4y{  +  12  y2  +  16.y3 

subject  to  X)  +  3x2  >15 

X|  +  2x3  >  10 

2x|  +  x2  >20 

1  if  Xj  >0  J  =  1,  2,  3 
yi  “  0ifx,  =  0 
Xj  >  0,  j  =  1 . 3. 

For  this  problem  Ph  is  as  follows: 

minimum  24>>|  4-  12.p2  +  16.y3 

subject  to  J'i+J':  >1 

yx  +^3  >1 

y,  +  y2  '  >1 

yj>  0,  y=l,2,  3. 

Solving  Pc  and  Pt  we  get  basic  variable  index  sets  Bc  =  |1,  2,  3}  and  fi8  =  (l), 
\(PC)  =  Z,.  =  41.5,  vfPj)  =  Z8  =  24.  We  can  also  compute  a  feasible  solution  to  P  using  Bc  to 
obtain  a  Z*  =  93.5.  We  now  branch  on  this  solution  to  Pc  by  setting  =  0  and  X|  =  0.  This 
yields  a  Zr  =  55  and  Z8  =  28  for  a  lower  bound  of  83  which  in  this  case  also  equals  a  new  Z* 
value  at  this  solution  (node  2  in  diagram). 

We  now  set  yj  =  1  and  compute  Zc  -  41.5,  Z8  =  0,  ZLB  =  41.5  +  0  +  24  =  65.5, 
B,  ~  { 1,  2,  3),  fl8  =  {(?},  J0  =  9,  7)  =  {1),  and  J2  -  {2,  3).  We  now  branch  on  this  last  node 
by  setting  y2  -  0  and  y2  =  1 .  The  zero  node  is  terminated  since  Zc  =  60,  Z8  =  0  and 
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ZLB  “  60  +  24  =  84  ^83.  The  one  node  yields  a  Zc  =  41.5,  Z8  =  0,  and 
ZLB  =  41.5  +  24  +  12  =  77.5.  For  this  last  node  (5  on  the  diagram),  Bc  =*  [  1,2,3}, 
Bs  =  {4,  6},  J0  =  0,  7,  =  { 1,  2),  and  i2  =  (3). 

If  we  then  branch  on  node  5  by  setting  =  1,  we  determine  a  ZLB  =  93.5  which  is 
greater  than  83  so  we  terminate  this  branch.  However,  branching  on  +3  =  0  yields  a  new 
Z*  =  79  1/3.  2,  =  43  1/3,  Zs  =  0  and  ZLB  =  43  1/3  +  24  +  12  =  79  1/3.  Since  the  lower 
bound  equals  Z*,  we  may  also  terminate  this  node  with  the  present  incumbent  solution,  i.e., 
Z*  =  79  1/3  and  Bc  =  { 1 .  2,  6),  being  optimal. 

At  the  root  node,  the  optimal  solution  to  the  LP  relaxation  of  (PM/)  is  found  by  comput¬ 
ing  Uj  =  15,  20,  5  for  j  =  1,  2,  3.  The  value  of  Z,  using  these  upper  bounds  for  this  problem 
is  58.7  as  compared  to  the  lower  bound  computed  above  of  65.5  =  v(Pf)  +  v(F8). 


4.  COMPUTATIONAL  RESULTS 

The  branch-and-bound  algorithm  discussed  previously  was  programmed  in  FORTRAN  for 
testing  on  a  CDC  CYBER  70/74  computer.  In  coding  the  algorithm,  standard  FORTRAN  was 
used  with  the  exception  of  bit  manipulation  extensions,  i.e.,  storage  of  all  pertinent  variable 
lists  was  done  using  bit  manipulation  of  the  CYBER  60  bit  words. 
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To  determine  a  good  starting  candidate  for  optimality.  Phase  I  of  Walker’s  (15)  heuristic 
was  used  after  the  solution  of  the  continuous  portion  of  the  linear  fixed  charge  problem.  This 
has  been  shown  to  give  very  good  approximate  results.  The  procedure  for  selecting  a  node  to 
remove  from  the  partial  solution  list  to  branch  on  was  very  simple  in  that  we  always  chose  the 
last  node  on  the  list.  Similarly,  we  used  a  simple  approach  to  select  a  variable  to  use  in  branch¬ 
ing.  The  branching  procedure  always  chose  the  first  free  variable  in  the  basis  for  the  continu¬ 
ous  problem  to  branch  upon  by  first  setting  the  corresponding  integer  variable  to  zero  and  then 
setting  it  to  one.  From  this  discussion,  it  is  clear  we  have  not  attempted  to  "optimize"  our  algo¬ 
rithm. 

To  test  this  code,  various  types  of  test  problems  were  solved.  The  first  class  of  problems 
were  general  fixed  charge  problems  with  all  equality  constraints.  These  problems  were  origi¬ 
nally  generated  by  Cooper  and  Drebes  ( 1  ]  and  have  been  used  by  various  authors  as  benchmark 
problems.  These  test  problems  are  of  size  5x10  with  the  following  characteristics. 

(i)  Ia„|  <  20 

(ii)  I  <  6,  <  999 

(iii)  1  <  Cj  ^  20 

(iv)  1  <  /,  999 

and  are  50%  dense.  To  test  larger  problems,  these  5x10  problems  were  combined  by  placing 
them  on  the  diagonal  to  generate  10  x  20  and  15  x  30  problems.  The  results  of  solving  these 
problems  with  our  code  are  shown  in  Table  1 . 


TABLE  1  —  Computational  Results  With 
Standard  Test  Problems 


Algorithm 

Machine 

m  x  n 

Number  of 
Problems 

Average 

Solution 

Time 

Branch-and-Bound 

CYBER  70/74 

5  x  10 

12 

.114 

10  x  20 

6 

1.866 

15  x  30 

4 

15.189 

Steinberg's  Search 

IBM  7072 

5  x  10 

15 

9.600 

15  x  30 

10 

1266.0 

Vertex  Ranking 

UNIVAC  1108 

5  x  10 

9 

.496 

10  x  20 

5 

37.704 

15  x  30 

1 

* 

Walker 

CDC  1604 

5  x  10 

52  (52) 

4. 

15  x  30 

5  (5) 

18. 

'Storage  overflow 
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As  noted  above,  these  problems  have  been  used  as  test  problems  by  various  previous 
researchers.  For  that  reason,  we  have  attempted  to  supply  comparative  computational  results. 
Previous  results  shown  in  Table  1  are  those  of  Steinberg’s  testing  of  his  search  procedure  on 
the  IBM  7072  and  McKeown’s  testing  of  the  vertex  ranking  procedure  on  the  UNI  VAC  1108. 
In  each  case  we  have  shown  the  CPU  seconds  to  find  and  prove  an  optimal  solution.  In  the 
case  of  our  branch-and-bound  code,  these  times  are  based  on  using  the  FTN  compiler  with 
OPT  —  2  on  a  CYBER  70/74.  We  have  also  shown  results  published  by  Walker  for  his  com¬ 
plete  heuristic  procedure.  For  that  algorithm,  the  value  in  parentheses  refers  to  the  number  of 
problems  tested  that  were  optimal. 

In  addition  to  the  benchmark  problems  we  have  compared  our  branch-and-bound  code  for 
fixed  charge  problems  (FCBB)  to  a  state-of-the-art  mixed-integer  code  based  on  Beale  and 
Tomlin  [14]  work  as  implemented  by  Sinha  [12]  which  we  will  denote  as  SMIP.  To  use  SMIP, 
upper  bounds  on  each  variable  must  be  found.  These  were  found  by  solving  Pu  for  each  vari¬ 
able.  This  was  done  in  FCBB  using  the  heuristically  derived  value  of  Z*  and  the  linear  pro¬ 
gramming  solution  value  of  P8,  i.e.,  v(Ps),  at  the  root  node.  Since  the  constraint  set  remains 
the  same  for  each  variable,  it  was  not  necessary  to  solve  each  version  of  Pu  from  scratch. 
Instead,  the  optimal  basis  for  the  first  variable  problem  was  used  as  a  starting  point  for  the 
second  variable  problem,  and  so  on  until  all  upper  bounds  were  computed. 

The  problems  solved  were  the  same  as  the  Cooper-Drebes  problems  with  the  exception 
that  problems  of  various  sizes  were  generated.  The  results  of  this  testing  are  shown  in  Table  2. 
For  each  set  of  five  problems,  we  have  shown  the  average  time  and  range  of  limes  (CPU 
seconds)  for  FCBB  and  SMIP.  We  have  also  shown  these  same  values  for  the  FCBB  without 
the  Walker  heuristic  and  for  SMIP  without  including  the  times  necessary  to  calculate  the  upper 
bounds.  The  former  calculations  were  made  to  determine  the  effect  on  FCBB  of  not  using  a 
heuristic  since  none  was  used  in  SMIP.  Similarly,  the  latter  calculations  were  made  to  compare 
FCBB  and  SMIP  under  the  assumption  that  upper  bounds  were  known  for  each  variable  and  did 
not  need  to  be  calculated. 

Looking  at  Table  2,  we  note  that  for  Problem  Sets  1-5,  FCBB  is  significantly  faster  than 
SMIP  for  5  x  10  problems  and  this  disparity  increases  as  the  number  of  variables  increases 
until  FCBB  is  faster  by  more  than  a  factor  of  20  for  5  x  50  problems.  The  same  general  trend 
exists  when  FCBB  is  compared  to  SMIP  without  the  upper  bound  calculations  but  as  would  be 
expected  the  SMIP  times  are  slightly  faster.  For  problem  sets  6-9,  the  relative  efficiency  of 
FCBB  compared  to  SMIP  is  not  as  dramatic  but  FCBB  remains  at  least  twice  as  fast  as  SMIP. 
Also,  SMIP  exceeded  storage  limitations  for  the  largest  problems. 

Two  other  results  may  also  be  noted  from  Table  2.  First,  for  problem  sets  1-5,  increasing 
the  number  of  variables  has  some  effect  on  solution  times  for  FCBB  but  not  as  much  as  would 
be  expected.  However,  there  is  a  much  more  dramatic  effect  of  increasing  the  number  of  vari¬ 
ables  for  problem  sets  6-9.  The  second  result  is  that  removing  the  Walker  Heuristic  has  very 
little  effect  on  the  solution  times  except  for  the  5  x  30  problems.  This  indicates  that  the  algo¬ 
rithm  is  finding  a  good  incumbent  value  early  in  the  branching  process. 

One  of  the  reasons  for  the  relative  efficiency  of  FCBB  to  SMIP  is  that  while  the  LP  relax¬ 
ation  has  m  +  n  constraints,  Since  both  procedures  use  pivoting  to  calculate  bounds,  the  larger 
number  of  constraints  slows  the  LP  relaxation. 
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TABLE  2  —  Comparison  of  Algorithms  (all  >  constraints) 


Problem  Set 

Size 

FCBB 

Time 

SM1P 

Time 

FCBB  (without 
heuristic) 
Time 

SMIP  (minus 
Bound 

Calculation  Time) 

Ave. 

Range 

Ave. 

Range 

Ave. 

Range 

Ave. 

Range 

1 

5  x  10 

.070 

.023- 

.202 

.130 

.068 

.022- 

.136 

.086- 

.219 

.389 

.531 

2.482 

2 

5  x  20 

.151 

.043- 

1.018 

.444- 

.158 

.045- 

.815 

.296- 

• 

.500 

2.805 

.531 

2.482 

3 

5  x  30 

.222 

.087 

1.896 

.785- 

.320 

.103- 

1.523 

.613- 

.610 

3.255 

1.048 

2.813 

4 

5  x  40 

.349 

.136- 

5.028 

2.593- 

.394 

.239- 

4.375 

2.036- 

.549 

8.678 

.582 

8.038 

5 

5  x  50 

.323 

.172- 

7.055 

4.492- 

.362 

.166- 

6.088 

3.656- 

.668 

13.383 

.661 

12.388 

6 

10  x  20 

.424 

.365- 

1.993 

1.082- 

.487 

.210- 

1.295 

.541- 

.626 

3.087 

.817 

2.135 

7 

10  x  30 

2.554 

1.086- 

6.416 

3.499- 

_ 

_ 

4.697 

2.219- 

4.373 

10.818 

9.041 

8 

10  x  40 

7.273 

2.111- 

15.727 

10.079 

_ 

_ 

12.767 

7.383- 

20.001 

25.011 

21.632 

9 

10  x  45 

11.822 

1.793- 

+ 

+ 

_ 

_ 

+ 

+ 

33.986 

+  Exceeded  storage  limitations. 


The  final  result  of  interest  in  Table  2  is  the  wide  range  of  times  for  each  problem  set. 
This  is  similar  to  results  reported  by  Ross  and  Soland  [10]  for  Special  Transportation  Problems 
and  by  Fisk  and  McKeown  [2]  for  Pure  Fixed  Charge  Transportation  Problems.  It  is  unclear  in 
any  of  these  contexts  as  to  what  makes  a  problem  "easy"  or  "difficult"  to  solve. 

Table  3  gives  results  of  using  FCBB  to  solve  six  versions  of  problem  set  3.  Three  charac¬ 
teristics  were  varied  to  determine  their  effect  on  solution  times.  These  characteristics  were 
density,  constraint  type,  and  relative  size  of  fixed  and  continuous  costs.  With  the  exception  of 
these  three  characteristics,  the  problems  in  Table  3  are  the  same  as  Problem  Set  3  in  Table  2, 
i.e.,  five  problems  in  each  set  with  each  problem  being  5  x  30  and  being  randomly  generated 
using  the  same  parameters  as  the  Cooper-Dreber  problems.  In  each  case,  the  original  results 
for  Problem  Set  3  are  used  as  a  bench  mark.  To  summarize,  Table  3  shows  that  problems  with 
equality  constraints  are  more  difficult  to  solve  than  problems  with  greater-than-or-equality  con¬ 
straints  but  problems  that  are  90%  dense  are  easier  to  solve  than  problems  that  are  50%  dense. 
Finally,  the  greater  the  continuous  costs  relative  to  the  fixed  costs,  the  easier  the  problems  are 
to  solve. 

The  first  two  results  have  not  been  reported  before  while  the  last  is  similar  to  that  shown 
by  Kennington  [6]  for  Fixed  Charge  Transportation  Problems.  The  first  result  is  thought  to  be 
caused  by  the  low  value  of  P 9  as  compared  to  the  sum  of  fixed  charges  in  the  optimal  solution 
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TABLE  3  —  Effect  of  Parameter  Variation 


Problem  Set 

Size 

Density 

C, 

Range 

F, 

Range 

Constraint 

Type 

FCBB* 
Ave.  Time 

Time* 

Range 

10 

T 

X 

30 

50% 

1-20 

0-999 

GT 

100 

39.18-274.77 

11 

5 

X 

30 

50% 

1-20 

0-999 

EQ 

402.25 

100.00-819.36 

12 

5 

X 

30 

90% 

1-20 

0-999 

GT 

78.82 

64.86-91.89 

13 

5 

X 

30 

50% 

0-0 

0-999 

GT 

276.57 

80.63-561.26 

14 

5 

X 

30 

50% 

1-99 

0-999 

GT 

79.81 

38.28-102.70 

15 

5 

X 

30 

50% 

1-20 

0-20 

GT 

38.28 

32.88-47.74 

"All  times  as  a  percent  of  Average  Time  for  Problem  Set  10  (Same  as  Set  3). 

to  P.  On  the  other  hand,  higher  density  values  will  lead  to  higher  values  of  Pt,  relative  to  the 
sum  of  fixed  charges  in  the  optimal  solution.  Finally,  higher  continuous  costs  lead  to  a  better 
overall  bound  since  v(Pf)  tends  to  dominate  the  overall  cost  structure  and  the  solution  to  Pc 
comes  closer  to  matching  the  optimal  solution  to  P.  However,  FCBB  was  able  to  solve  prob¬ 
lems  with  the  continuous  costs  equal  to  zero  in  a  reasonable  length  of  time.  This  is  the  first 
time  problems  of  this  type  have  been  attempted  and  successfully  solved.  For  problems  of  this 
type,  the  Pt  bound  alone  appears  to  be  adequate. 

In  Table  4,  we  have  compared  the  lower  bounds  found  by  FCBB  and  by  using  the  optimal 
solution  to  the  linear  programming  relaxation  (/*«/).  We  used  six  problems  previously  solved 
as  portions  of  Problem  Sets  3  and  8.  For  each  problem,  we  have  shown  the  time  to  compute 
the  bound,  the  lower  bounds  as  a  percentage  of  the  optimal  solution  (V(P))  and  the  solution 
time.  The  results  from  these  six  problems  do  not  show  any  clear  trends.  The  lower  bounds 
tend  to  favor  the  FCBB  algorithm  as  do  the  solution  times  but  the  lower  bound  for  Problem  3c 
is  better  using  the  LP  relaxation.  The  only  general  result  that  can  be  derived  from  the  table  is 
that  the  quality  of  the  bound  does  not  in  itself  guarantee  a  faster  solution  time  for  a  given  prob¬ 
lem. 


TABLE  4  —  Comparison  of  Bounds 


Problem 

Size 

FCBB* 

Bound 

Time 

FCBB" 

Bound 

FCBBC 

Solution 

Time 

LP* 

Bound 

Time 

LPb 

Bound 

LP1 

Solution 

Time 

3a 

5  x  30 

38.59 

77.98 

5.64 

39.86 

72.65 

100 

3b 

5  x  30 

63.95 

85.31 

11.05 

79.30 

71.99 

100 

3c 

5  x  30 

58.09 

71.01 

11.69 

81.41 

77.09 

100 

8a 

10  x  30 

12.82 

54.29 

14.28 

23.64 

29.26 

100 

8b 

10  x  30 

9.92 

60.85 

40.95 

49.95 

53.84 

100 

8c 

10  x  30 

5.25 

35.47 

52.81 

41.59 

45.21 

100 

•  Alt  limes  as  a  percent  of  time  to  find  the  optimal  solution  for  that  algorithm.  For  the  LP  bound, 
the  times  include  the  time  to  compute  upper  bounds  on  the  variables. 

bAII  bounds  as  a  percent  of  optimal  value  (v(P)). 

'All  solution  times  as  a  percen'  of  time  using  LP  relaxation  (SMIP)  to  find  optimal  solution 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  28,  NO  4,  DECEMBER  1981 


BRANCH-AND-BOUND  FOR  FIXED  CHARGE  PROBLEMS 


617 


5.  CONCLUSIONS 

We  have  presented  here  a  branch-and-bound  procedure  for  solving  linear  fixed  charge 
problems  that  uses  a  new  approach  to  bounding  on  the  fixed  charges  as  well  as  on  the  continu¬ 
ous  costs  while  not  requiring  a  knowledge  of  upper  bounds  on  the  continuous  variables.  Com¬ 
putational  results  for  this  procedure  have  been  presented  which  appear  to  be  significantly  better 
than  previous  algorithms  or  state-of-the-art  mixed  integer  codes. 

Future  research  in  this  area  would  include  testing  of  various  selection  rules  and  branching 
schemes  in  order  to  optimize  the  algorithm,  use  of  mass  storage  files  to  store  bases  for  partial 
solutions  and  testing  to  determine  what  makes  a  problem  "easy”  or  "difficult"  to  solve.  This 
procedure  could  also  be  specialized  for  fixed  charge  transportation  problems  by  using  network 
dual  pivoting  procedures  for  calculating  bounds. 
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ABSTRACT 

This  paper  discusses  the  properties  of  the  inventory  and  advertising  policy 
minimizing  the  expected  discounted  cost  over  a  finite  horizon  in  a  dynamic 
nonstationary  inventory  model  with  random  demand  which  is  influenced  by  the 
level  of  promotion  or  goodwill.  Attention  is  focused  on  the  relation  between 
the  fluctuations  over  time  of  the  optimal  policies  and  the  variations  over  time 
of  the  factors  involved,  i.e.,  demand  distributions  and  various  costs  The  op¬ 
timal  policies  are  proved  to  be  monotone  in  the  various  factors.  Also,  three 
types  of  fluctuations  over  time  of  the  optimal  policies  are  discussed  according  to 
which  factor  varies  over  time.  For  example,  if  over  a  finite  interval,  the  ran¬ 
dom  demand  increases  (stochastically)  from  one  period  lo  the  next,  reaches  a 
maximum  and  then  decreases,  then  the  optimal  inventory  level  will  do  the 
same.  Also  the  period  of  maximum  of  demand  never  precedes  that  of  max¬ 
imum  inventory.  The  optimal  advertising  behaves  in  the  opposite  way  and  its 
minimum  will  occur  at  the  same  time  as  the  maximum  of  the  inventory.  The 
model  has  a  linear  inventory  ordering  cost  and  instantaneous  delivery  of 
stocks;  many  results,  however,  are  extended  to  models  with  a  convex  ordering 
cost  or  a  delivery  lime  lag. 


1.  INTRODUCTION 

This  paper  focuses  on  the  problem  faced  by  the  manager  of  a  store  with  important  inven¬ 
tory  costs  and  an  advertising  budget  in  the  presence  of  fluctuations  in  economic  conditions.  In 
particular,  qualitative  results  are  provided  linking  fluctuations  in  economic  conditions  (demand, 
holding  costs,  advertising  costs)  to  fluctuations  in  jointly  managed  optimal  inventory  orderings 
and  advertising  levels.  Under  the  usual  convexity  assumptions,  knowledge  of  the  fluctuations 
of  the  demand  over  time  is  sufficient  to  determine  those  of  the  optimal  policy  without  comput¬ 
ing  them  explicitly. 

Very  few  authors  have  attempted  to  optimally  integrate  inventory  policy  with  advertising 
policy.  In  fact,  the  only  reference  we  know  of  is  Miercourt  (81.  He  established  the  existence 
and  the  characterization  of  the  optimal  policy,  though  he  did  not  attempt  to  discuss  their 
fluctuations  over  time.  The  general  approach  to  the  fluctuation  problem  is  along  the  lines  of 
Karlin's  work  15]  and  its  extensions  by  Veinott  [19],  [22],  Karlin  established  the  relationship 
over  time  of  the  optimal  inventory  policy  with  the  fluctuations  of  the  demand  for  a  discrete 
time  dynamic  inventory  model  involving  a  single  commodity  with  random  demand.  Veinott 
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extended  these  results  in  various  ways  emphasizing  refinements  that  are  possible  with  transla¬ 
tions  of  the  demand  distributions.  He  also  showed  that  the  proper  concept  linking  the  varia¬ 
tions  of  the  optimal  policy  over  time  with  the  fluctuations  of  the  demand  and  other  parameters 
is  the  myopic  policy.  A  proper  interpretation  of  seme  of  our  results  extends  the  work  of  Pier- 
skalla  {101  relating  optimal  inventory  policy  with  monotone  obsolescence  probabilities.  The 
work  of  Topkis  and  Veinott  [181,  Topkis  {171  and  Veinott  [261  on  subadditive  functions  on 
sublattices  and  its  application  to  inventory  problems  by  Veinott  [25],  provides  us  with  the 
proper  tools  and  methods  for  our  analysis.  A  knowledge  of  their  theory,  at  least  to  the  extent 
of  the  Appendix  of  Balcer  [31,  is  essential  to  understand  this  paper.  Results  from  this  Appen¬ 
dix  are  referred  to  by  the  letter  A  followed  by  a  number,  such  as  Theorem  A2,  Lemma  A1 
Example  A5.  Results  established  in  this  paper  are  referred  to  by  numbers,  such  as  Theorem  6 
or  Lemma  5. 

An  abundant  literature  (see  Balcer  (21)  covers  the  problem  of  optimizing  advertisement 
expenditures  given  a  known  demand-advertising  relationship  and  no  inventories.  Most  of  them 
generalize  the  model  of  Arrow  and  Nerlove  {11,  which  introduces  the  concept  of  goodwill 
increased  by  advertising  and  decaying  exponentially.  In  that  literature,  the  work  of  Kuehn  [7) 
stands  out  because  of  the  resemblance  of  some  of  his  results  with  ours.  His  model  is  not 
clearly  defined  and  is  said  to  be  inspired  by  a  previous  model  of  his  [6]  which  was  dynamic  with 
random  demand  (the  randomness  was  introduced  by  a  Markov  process  via  brand  switching). 
He  concluded  that  the  advertising  expenditure  fluctuates  over  time  in  unison  with  sales  and 
with  peaks  and  bottoms  of  advertising  expenditure  preceding  those  of  sales. 

As  mentioned  earlier,  our  main  focus  is  to  link  these  fluctuations  with  variations  in  pri¬ 
mary  factors  such  as  demand,  ordering  cost,  holding  and  shortage  costs,  promotion  effect,  etc. 
In  Section  2,  we  set  up  an  inventory  model  with  advertisement.  For  this  model  some  relevant 
results  from  Balcer  [3]  regarding  the  existence  and  the  characterization  of  the  optimal  policy  are 
reproduced. 

In  Sections  3  and  4  we  establish  that  the  optimal  solutions  in  a  given  period  are  monotone 
in  the  various  costs  of  that  period  or  future  ones.  In  Section  5  we  exhibit  the  relationship 
between  optimal  inventory  policies  which  take  into  account  all  present  and  future  costs, 
myopic  inventory  policies  which  take  into  account  only  present  costs.  In  Sections  6  and  7  tti^ 
fluctuations  in  the  optimal  inventory  and  advertising  policies  are  linked  to  the  fluctuations  in 
the  myopic  policies  which,  in  turn,  vary  with  changes  in  the  parameters  of  the  problem  such  as 
the  various  costs  and  the  demand.  The  problem  of  inventory  with  varying  obsolescence  proba¬ 
bilities  treated  previously  by  Pierskalla  is  discussed.  Finally,  in  Section  8,  the  results  of  the  pre¬ 
vious  sections  are  extended  to  models  with  convex  inventory  ordering  costs  or  with  lag  in 
delivery  of  the  ordered  inventory.  Sections  5,  6,  and  7  are  more  closely  related  to  Karlin’s 
work  and  Sections  3,  4,  5  and  8  to  Veinott’s. 

2.  MODEL 

In  this  paper  we  will  study  a  discrete  time  dynamic  model  of  single  commodity  manage¬ 
ment,  when  the  nonnegative  demand  for  the  commodity  in  each  period  is  uncertain  but  has  a 
known  distribution  depending  on  the  existing  level  of  goodwill.  At  the  beginning  of  each 
period,  the  manager  knows  the  initial  inventory  x,  the  present  and  future  demand  distributions 
and  the  cost  structure.  He  decides  to  increase  instantaneously  the  initial  inventory  x  to  a  start¬ 
ing  level  y  >  x  by  ordering  at  a  unit  cost  c,  and  goodwill  to  a  starting  level  b  ^  0  by  advertis¬ 
ing  at  unit  cost  p.  The  demand  gib)  +  U  is  the  sum  of  a  nonnegative  random  variable  U  and  a 
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nondecreasing  function  of  goodwill  such  that  giO)  =  0.  During  this  period,  the  demand 
gib)  +  U  occurs,  so  the  terminal  inventory  becomes  y  -  gib)  -  U.  The  initial  inventory  in 
the  next  period  is  yiy—gib)  —  U)  where  r?(z)  =  «+z+  -  tj_z~  with  19+  ^  n-  ^  0, 
z+  =  max(z.O)  and  z“  =  max(-z,  0).  When  the  slope  of  n  is  between  zero  and  one,  (1  —  rj+) 
is  interpreted  as  a  depletion  or  loss  of  inventory  and  ( 1  —  rj _ )  as  a  loss  of  sales  arising  from 
impatient  consumers  who  depart  before  receiving  their  orders.  The  initial  level  of  advertising 
in  the  next  period  is  0  since  the  effect  of  advertising  is  ephemeral. 

During  each  period,  the  manager  incurs  a  capacity  cost,  h ,  on  the  starting  inventory,  and  a 
shortage-holding  cost,  s,  on  the  terminal  inventory.  Both  of  these  cost  functions  are  convex, 
and  the  function  s  increases  to  infinity  with  its  argument.  Moreover,  s  is  nondecreasing  in  z  on 
the  nonnegative  real  line.  The  function  h  is  bounded  below.  There  is  a  unit  selling  price  r. 
The  current  sale  price  is  paid  when  each  consumer  demand  is  incurred.  This  yields  a  gross 
revenue  of  rigib)  +  U)  to  the  manager.  If  the  inventory  is  positive,  the  consumer  receives 
the  commodity  without  delay  until  either  the  demand  is  totally  satisfied  or  the  inventory  is 
completely  exhausted,  whichever  comes  first.  If  consumers  subsequently  depart  or  increase 
their  orders  without  being  served,  the  manager  refunds  or  pockets,  respectively,  the  then 
current  sale  price.  The  sale  is  final  only  when  the  consumers  receive  the  commodity  they  have 
purchased.  We  assume  that  EU  and  Esiy  -  gib)  —  U )  are  finite,  where  £  denotes  the  expec¬ 
tation  and  that  the  demand  function  gib)  is  increasing  and  concave  in  the  goodwill.  This  last 
assumption  has  been  verified  empirically  by  Shryer  [11]  and  Stone  [14]  for  mail  ordering, 
Telser  [151  for  cigarettes,  Palda  [9]  for  drugs,  Clement  et  al.  [4]  for  milk  and  Simon  [12],  [13] 
for  liquor. 

Given  a  finite  horizon  N,  let  C"(x)  be  the  minimum  expected  discounted  cost  in  the 
periods  n,  . . .  ,  N,  where  x  and  0  are  respectively  the  initial  inventory  and  goodwill  in  period  n. 
The  function  C"  can  be  calculated  for  each  period  by  the  dynamic  programming  recursion 

(1)  Cn(x)  =  min  (c(y  -  x)  +  pb  +  h iy)  4-  Esiv  -  gib)  -  U) 

vS.x7.50 

-  r{g(b )  +  U)  +  (1  -  r >_)r£(y  -  gib)  -  U)~ 

+  K  £Cn+1  fr)  (y  —  gib)  -  t/)]}. 

for  n  =  1 . N,  where  C,v+I(x)  =  -  cx.  Every  symbol  in  (1)  should  be  indexed  by  n; 

however,  when  no  confusion  is  possible  because  of  the  context,  the  index  n  is  suppressed 
throughout  this  paper. 

It  is  convenient  first  to  transform  the  problem  to  one  with  no  ordering  costs,  a  technique 
which  has  been  used  by  Veinott  [20].  This  is  achieved  by  letting  C"ix)  —  C"ix)  +  cx. 
Second,  the  problem  is  not  convex  in  iy.  b ),  but  it  is  in  iy.P)  where  0  -  gib),  the  impact  of 
advertising  on  sales.  Because  gib)  is  increasing  in  b,  the  optimal  policy  in  the  (,v. / 3)-space  can 
be  expressed  in  the  original  (y.  b)- space.  After  the  two  transformations,  recursion  (I)  becomes 

(2)  Cnix)  =  min  )W(y)  -F  Pip)  +  Siv  -  p)  +  X  EC^'bl  (v  -  P  -  U))). 

t  S  x.0  5  0 

where 

(3)  Hiz)~  cz  +  hiz), 

(4)  Piz)  -  wr'(z)  -  rz 
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and 

(5)  S(z)  =  E[s(z  -  U)  -kc'-qiz  -  U)  +  (1  -  r ,_)r(z  -  U)~ I. 

The  functions  H \  P ,  and  S'  are  respectively  the  aggregate  inventory  ordering-capacity  cost,  the 
aggregate  advertising-revenue  cost  and  the  aggregate  shortage-holding  cost.  The  prime  indi¬ 
cates  that  the  parameter  is  from  the  following  period.  Also,  a  constant  term,  —rEU,  has  been 
omitted  as  it  clearly  does  not  affect  the  choice  of  the  optimal  policy.  Note  now  that 
CiV  +  1(x)  =  0.  For  a  detailed  discussion  of  these  transformations,  see  Balcer  12)  or  13). 

In  Balcer  [31,  under  the  mild  condition  that  S  is  convex  at  0, 

(1)  D*s( 0)  +  r  —  kc'-q+  >  tj_  (r  —  kc')  +  D~s(0). 

where  D+s( 0)  (D~  s( 0))  indicates  the  right  (left)  derivative  of  s  at  zero  and  some  weak  condi¬ 
tions  at  infinity, 

(II. a)  +  D~S(o°)  >  o, 

(11. b)  D~  H(°°)  +  D~P(°°)  >  0, 

di  e)  d  p< °°)  -  zrs<- °°)  >  o, 

(ll.d)  D*H(—°°)  +  D*S(—°°)  <  0, 

it  is  shown  that  C'Tx)  is  convex  and  nundecreasing  in  x  and  that  a  finite  optimal  policy  exists. 
Moreover,  it  is  shown  that  the  minimand  in  (2)  is  subadditive  in  {y.p)  and  that  the  set  of  all 
optimal  policies  for  (2)  is  a  nonempty  compact  sublattice.  For  sake  of  definiteness,  the  optimal 
policy  (y(x)./3(x))  is  chosen  to  be  the  lexicographically  least  element  of  that  sublattice. 

From  here  on  (y"(x),  /3"(x))  denotes  the  optimal  policy  in  period  n  whose  components 
are  the  optimal  inventory  and  the  optimal  controlled  demand  and  which  minimizes  the  right- 
hand  side  of  (2).  The  smallest  solution  to  the  minimization  of  the  total  costs  as  given  by  (2) 
when  only  one  of  the  two  variables  can  be  chosen  and  when  that  variable  is  unconstrained  is 
denoted  by  yn(a)  andf3"(x),  respectively,  (a  is  an  arbitrary  value  taken  by/3  when  the  minim¬ 
ization  is  done  over  y  only.)  Also,  the  solution  to  the  minimization  of  the  total  costs  in  period 
n  as  described  by  (2)  when  the  two  variables  are  unconstrained  is  Cv*",  P  *").  This  defines  the 
base  stock  level  in  period  n  (as  we  shall  see  below)  whose  components  are  the  base  inventory 
level  and  the  base  controlled  demand.  The  differences,  y”(x)  -  /3"(x)  and  y*"  -  (3  are  the 
optimal  net  inventory  and  the  base  net  inventory  in  period  n,  rewritten  z"(x)  and  z*",  respec¬ 
tively.  Finally,  the  solution  to  the  minimization  of  the  total  present  costs  in  period  n  as  given 
by  the  right  hand  side  of  (2)  with  the  last  term  omitted  is  the  myopic  policy.  In  the  preceeding 
sentences,  if  we  replace  superscript  n  by  subscript  n  we  have  the  myopic  counterparts  of  the 
optimal  solutions.  For  example,  z*  is  the  myopic  base  net  inventory  in  period  n.  In  the  sequel, 
as  mentioned  earlier,  the  superscript  n  is  omitted  when  no  confusion  is  possible  because  of  the 
context. 

Finally,  the  following  theorem,  which  parallels  similar  results  of  Veinott  [211  for  a  mul¬ 
tiproduct  inventory  model,  holds: 
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THEOREM  1:  Under  the  above  conditions,  y(x)  =  x  V>,(/3*)  and  /3(x)  -  /§  O'*  V  x) 
where  y  and  /3  are  nondecreasing  on  their  respective  domain. 

The  symbol  V  denotes  the  maximum  of  the  two  elements  on  each  side  of  it.  In  addition, 
if  this  additional  condition  holds, 

(III)  D~H( 0)  +  D"S(0)  <  0, 

the  optimal  policy  is  nonnegative  and  n  does  not  need  to  be  convex. 

3.  MONOTONICITY  OF  THE  OPTIMAL  SOLUTION  IN  THE  AGGREGATE  COSTS 

In  this  section,  we  will  describe  the  changes  in  J"(x),  /3"(x)  and  z"(x)  when  changes 
occur  in  the  aggregate  costs  H,  P  and  S.  The  importance  of  this  and  the  following  section 
results  from  the  ability  to  predict  changes  in  the  optimal  strategy  when  some  underlying  condi¬ 
tions  are  changing  without  having  to  recompute  the  optimal  policy.  As  one  is  unfamiliar  with 
these  aggregate  costs,  the  results  of  this  section  will  not  be  as  intuitive  as  those  of  the  next  sec¬ 
tion  which  are  in  terms  of  the  primary  costs  and  the  demand  distributions.  We  prove  all  the 
monotonicity  results  with  the  aggregate  costs  as  the  proofs  are  sharper  and  the  results  specialize 
easily  to  the  primary  costs. 

Before  proceeding,  additional  notations  and  a  lemma  on  ordered  sets  of  functions— to  be 
defined  below— have  to  be  set.  The  class  ft  of  all  real- valued  functions  on  R  can  be  quasi- 

ordered  by  the  incremental  order  defined  as  follows.  First,  write  id  ~  id'  on  R  if  id  -  id'  is  con¬ 

stant  on  R.  Evidently  —  is  an  equivalence  relation  on  ft .  Now,  id  is  incrementally  smaller 
than  w'  €  ft  written  w  ^  id',  if  for  all  v  <  w  €  R,  u>(w)  -  <u(v)  ^  w'(w)  -  w'(v),  if  id  and 
id'  are  differentiable,  it  implies  that  0«o(v)/0v  <  0<o'(v)/0v  for  all  v  6  R. 

LEMMA  1:  If  0^  ft'  C  fi  and  fi '  is  a  chain,  then  o>(v)  is  superadditive  in  ( id.v )  on 
n'  X  R. 

PROOF:  Suppose  (<u.v).  (<u',  v')  €  ft'  x  R  are  incomparable.  Since  O'  and  R  are  chains, 
we  may  suppose  to  ^  id'  and  v  >  v\  Then  (to  A  to’)  (v  A  v')  +  (to  V  to’)(v  V  v')  =  to(v')  + 

to'(v)  ^  to'(v')  +  to  (v)  by  definition  of  <,  completing  the  proof. 

If  id"  and  id'"  are  sequences  of  real-valued  functions,  i.e.,  to"==  (to„ . toA.)  and  a>’"  = 

Wfl ,  ,  «;),  write  to"  <  to'"  if  to,  ^  to/  for  i  —  n . /V.  If  ft  is  a  quasi-ordered  set,  its 

dual,  ft  *,  contains  the  same  elements  with  the  ordering  reversed.  In  other  words,  if  to  and  to' 
belong  to  ft  and  to  <  to',  then  to  and  to'  belong  to  ft  *  and  to'  ^  to. 

Denote  Jt„,  and  the  classes  of  all  real  valued  continuous  convex  functions  H„ ,  Pn 
and  S„,  respectively,  satisfying  condition  (II).  Denote  by  and ,  respectively,  given 

chains  in  % and  9n.  The  sets  ft",  9",  /",  ft'”,  9'",  and./'"  are  defined  similarly.  In  the 
sequel,  it  is  understood  that  these  sets  are  ordered  by  the  incremental  order.  In  order  to  exhi¬ 
bit  the  dependence  of  y"(x)  and  y„(x),  say,  on  the  aggregate  costs,  we  write  yfxtto")  and 
y  (x;  id „)  instead  of  y"(x)  and  y„(x),  respectively,  where  id  can  be  either  H ,  P  or  5.  In 
Theorems  2,  3  and  4  to  follow,  all  parameters  of  the  problem  are  held  fixed  but  for  the  one  so 
specified  in  the  theorem.  Changes  in  the  optimal  policies  are  linked  to  the  variations  of  the 
aggregate  ordering-capacity  cost  H,  the  aggregate  advertising-revenue  cost  P,  and  the  aggregate 
shortage-holding  cost  S.  The  next  four  theorems,  particularly  Theorem  5,  are  an  extension  of  a 
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result  and  a  proof  of  Veinott  [23!  for  the  impact  of  future  demands  on  current  optimal  inven¬ 
tory  level;  this  result  was  subsequently  extended  by  Topkis  [16]  for  the  impact  of  future  cost 
functions  characterized  by  a  single  parameter  on  current  optimal  inventory  level. 

THEOREM  2:  For  all  n,  y(x,H")  and  z(x,H")  are  nonincreasing  in  Hn  on  /3(x;  H") 
is  nonincreasing  in  Hn  and  nondecreasing  in  Hn+'  onSf",  p{x,H")  is  independent  of  Hn  and 
nondecreasing  in  Hn  +  1  on  St",  and  C(x,Hn)  is  superadditive  in  (x,  H,)  on  R  *  for 

/  -  n . N. 

PROOF:  See  Appendix. 

THEOREM  3:  For  all  n ,  y(x\  Pn )  is  nonincreasing  in  P"  on^",  4  [a;  P")  and  4 (x;  P")  are 
nonincreasing  in  P„  and  nondecreasing  in  Pn+X  on  0>n\  z(x\Pn)  is  nondecreasing  in  P„  and 
nonincreasing  in  P"+ 1  on  0>n,  and  C(x,P" )  is  superadditive  in  (x,  /»)  on  R  x  for 
i  -  n . N. 

PROOF:  See  Appendix. 

THEOREM  4:  For  all  n.  y(x\Sn)  and  z(x;Sn)  are  nonincreasing  in  Sn  on /3(x;S") 
and4(x;S”)  are  nondecreasing  in  S"  also  C(x;  Sn)  is  superadditive  in  (x.  SJ  on  R  x if] 

for  t  =  n . N. 

PROOF.  See  Appendix. 

In  Theorems  2,  3  and  4  we  establish  that  an  increase  in  any  present  and  future  aggregate 
costs  lead  to  a  lower  optimal  level  of  inventory  on  hand  both  before  and  after  adverstising. 
This  is  to  be  expected,  as  the  cost  of  capacity  goes  up,  less  inventory  is  purchased  in  anticipa¬ 
tion  of  sales;  also,  as  the  cost  of  advertising  goes  up,  less  advertising  will  take  place,  thus  less 
inventory  is  needed;  and  finally,  as  the  holding  cost  function  increases,  less  inventory  is  pru- 
chased,  so  less  will  remain  at  the  end  of  the  period,  thus  reducing  the  holding  costs.  The  situa- 
tion  is  a  bit  more  delicate  for  the  optimal  level  of  advertising.  Recall  that  b(x)  -  g_1(/3(x)), 
where  g  1  is  monotone  increasing.  When  capacity  cost  increases  in  the  present  period, 
advertising  is  reduced  as  the  cost  increase  has  led  to  a  reduced  inventory  on  hand  to  be  sold; 
but  when  capacity  costs  increase  for  the  future  periods,  advertising  is  expanded  in  the  current 
period  to  increase  current  demand  and  therefore  reduce  the  initial  inventory  level  in  future 
period.  Obviously,  as  advertising  cost  increases  in  the  current  period,  the  amount  of  advertis¬ 
ing  is  reduced;  however,  as  advertising  costs  increase  in  subsequent  periods,  current  expendi¬ 
tures  on  advertising  will  increase  as  the  total  advertising  budget  is  shifted  in  favor  of  the 
current  period  which  is  thus  made  relatively  cheaper.  Finally,  as  the  holding  cost  functions 
increase  in  either  present  or  future  periods,  it  results  in  increased  advertising  to  generate  more 
sales,  and  to  reduce  the  terminal  inventory  level,  thus  reducing  holding  costs. 

The  results  of  the  last  three  theorems  are  summarized  in  Table  1  for  comparative  pur¬ 
poses.  A  plus  indicates  that  the  solution  is  nondecreasing  in  the  aggregate  cost  and  a  minus 
indicates  that  the  solution  is  nonincreasing  in  the  aggregate  cost.  For  example,  Fix)  is  nonde¬ 
creasing  in  P„  and  nonincreasing  in  S". 

4.  MONOTONICITY  OF  THE  OPTIMAL  POLICY  IN  THE  PRIMARY  COSTS 

AND  THE  DEMAND 

In  this  section,  we  apply  the  results  of  Section  3  to  obtain  the  monotonicity  properties  of 
the  optimal  policy  in  the  costs  and  the  basic  demands.  The  results  of  this  section  tell  us  what 
happens  to  the  optimal  inventory  level  F(x;-n),  the  optimal  controlled  demand  fiix,  ■ ")  thus  to 
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TABLE  1  —  Monotonicity  of  the  Optimal  Policies  in  Aggregate  Costs 


Ordering-Capacity 

Advertising-Revenue 

Holding-Shortage 
Present  and  Future 

Future 

H"+\ 

Present 

Pn 

Future 

p/l  +  I 

S" 

Inventory  Level 

J(x;  •") 

_ 

_ 

— 

— 

Net  Inventory  Level 
z(x;  • ") 

_ 

+ 

_ 

— 

Advertising  Level 
b-x\  •")  =  g-1</3  (x;  •")) 

_ 

+ 

— 

+ 

+ 

the  optimal  advertising  level  b(x\'")  and  to  the  net  inventory  level  z(x;  • ")  when  any  primary 
factors  change  either  in  the  current  or  in  a  future  period.  Note  that  when  a  factor  is 
represented  by  a  function,  an  increase  in  the  factor  means  that  it  is  now  represented  by  a  func¬ 
tion  which  is  greater  than  the  previous  one  according  to  the  incremental  order. 

Before  linking  the  variations  of  the  aggregate  costs  to  those  of  the  primary  costs  and  thus 
predicting  the  variations  of  the  optimal  policies,  we  shall  establish  some  results  for  the  varia¬ 
tions  in  the  demand  distribution.  The  set  &  of  all  demand  distributions  on  the  nonnegative  real 
half  line  can  be  partially  ordered  in  a  natural  way  by  a  stochastic  order  defined  as  follows.  We 
say  F  €^is  stochastically  smaller  than  G  €  #  written  F  C  G,  if  F(u)  >  G(u)  for  all  real  u. 

Let^"  be  the  N  -  n  +  1-fold  cartesian  product  of  Then  we  say  F"  =  (F„ . FN)  is 

stochastically  smaller  than  G"  =  (G„ . G*)  €^",  written  F"  C  G",  if  F,  C  G,  for 

/  -  n . N.  In  the  sequel  it  is  understood  that£r'1  is  always  partially  ordered  by  stochastic 

order. 

THEOREM  5:  For  all  n,y(x\F")  and  z (x ; F")  are  nondecreasing  in  F"  on  0 (x; F") 
and  P  (x;  F")  are  nonincreasing  in  F"  or  also,  C  (x ;  Fn)  is  subadditive  in  (x,  Ft)  on  R  * 
for  I  —  n . N. 

PROOF:  We  assert  that  F"  C  F'n  in  implies  that  S"  >  S'"  in  if",  where  S  and  S'  are 
defined  by  equation  (5).  To  see  this,  for  i  ^  n,  let  G,'-  F'~i'(V)  and  U,  -  F~HV)  where  Kis 
a  uniform  random  variable  on  [0,  1],  By  the  stochastic  ordering,  F,“*  ( V)  ^  F^H  V)  for  all 
values  of  V.  Thus,  by  convexity  of  S,  if  v  <  v',  then  for  all  z  <  z',  S(z'- v)  -  S(z  -  v)  ^ 
S(z'-  v')  -  S(z  -  v').  Replacing  v  and  v'  by  Ft~ 1  ( F)  and  F^H  V),  respectively,  we  conclude 
that  Sj  ^  SI  for  all  /,  thus  S"  ^  S'"  in  <f".  This  completes  the  proof  of  the  assertion.  In  the 
proof  of  Theorem  4,  KECiviz  ~  U)\  5'l+l)  is  a  function  of  F„  but  not  of  S„.  However,  since 
C"+1  (tj (x ) )  is  convex  nondecreasing  in  x,  X£C(t)(x  -  U))  is  convex  nondecreasing  in  x,  then 
by  Lemma  A2,  \£C(tj(z  -  G);Sn+1)  is  subadditive  in  (z,  F„).  With  this  modification,  the 
proof  carries  over  implying  the  results. 

So,  as  present  or  future  expected  demands  increase  (F  stochastically  smaller  than  G 
implies  that  the  expected  demand  based  on  Fis  smaller  than  the  one  based  on  G),  the  optimal 
level  of  inventory  should  increase.  Surprisingly,  as  present  or  future  demand  goes  up,  the 
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optimal  advertising  level  decreases.  This  leads  us  to  believe  that  advertising  is  done  to  compen¬ 
sate  for  the  lack  of  demand  in  the  market,  though  we  cannot  conclude  that  advertising  expendi¬ 
tures  are  countercyclical  to  market  demands.  This  analysis  will  be  pursued  in  Section  7  were 
fluctuations  over  time  are  investigated.  But  why  are  advertising  expenditures  decreasing  when 
the  demand  picks  up?  Since,  in  the  present  model,  total  demand  is  a  sum  of  a  stochastic 
demand— a  component  that  varies  exogenously— and  a  controlled  demand  which  depends  on 
the  advertising  expenditures,  advertising  is  not  more  or  less  effective  in  generating  demand 
when  the  stochastic  component  of  demand  is  great  or  small.  At  first,  it  appears  that  advertising 
expenditures  should  be  constant,  but  if  we  think  of  the  controlled  demand  as  demand  that  is 
purchased  at  a  price,  its  price  goes  up  with  an  increase  in  stochastic  demand.  This  is  so, 
because  at  higher  stochastic  demand,  holding  advertising  expenditures  constant,  more  demand 
is  generated,  more  inventory  is  needed  and  the  capacity  cost  of  the  last  unit  increases  with 
inventory,  making  the  last  unit  of  advertising  expenditures  unprofitable.  Therefore,  as  stochas¬ 
tic  demand  goes  up,  advertising  expenditures  are  curtailed. 

LEMMA  2:  For  all  n,fi"(x)  is  nondecreasing  in  r„  forv~  <  1 

PROOF:  In  equations  (4)  and  (S),  the  mixed  partial  derivative  of  P(0)  with  respect  to/J 
and  r„  is  —1,  the  mixed  partial  derivative  of  S(y  —  ft)  with  respect  to  y  and  rn  is 
-0  -  r)_)F(y  -  0),  where  F(z)  -  _1  -  F(z),  and  the  mixed  partial  derivative  of  SCv  ~ 
with  respect  to  /3  and  r„  is  (1  -  y~)F(y  -  &).  Therefore,  since  S(y  -  fi)  is  convex,  the  right 
hand  side  term  in  (2)  is  subadditive  in  (y,  1 3,  r„)  by  example  A6  and  Lemma  Al.  By  Theorem 
A2,  the  least  element,  ( y(x\rn ),  /3(j r,rn)),  minimizing  the  right-hand  side  term  in  (2)  is  non¬ 
decreasing  in  /•„.  This  completes  the  proof. 

Using  (3),  (4)  and  (5)  the  variations  in  the  primary  costs  result  in  variations  of  the 
appropriate  aggregate  costs  and  Theorems  2,  3,  4,  or  Table  1,  Theorem  5  and  Lemma  2  apply. 
All  the  results  regarding  variations  of  the  optimal  policies  are  reported  in  Table  2.  Certain 
parameters  have  more  complex  variations  and  no  result  on  optimal  policies  can  be  ascertained 
for  their  variations  in  future  periods.  However,  by  restricting  ourselves  to  the  current  period, 
variations  of  the  myopic  policies  can  be  linked  to  those  of  the  primary  costs.  Again,  using  (3), 
(4)  and  (5)  with  Theorems  2,  3  and  4,  variations  of  the  myopic  policies  are  reported  in  Table  3 
for  the  primary  costs  when  no  results  for  the  corresponding  optimal  policies  were  obtainable. 
The  convention  used  in  Tables  2  and  3  is  identical  to  the  one  in  Table  1. 

TABLE  2  —  Monotonicity  of  the  Optimal  Policies  in  the  Primary  Factors 


Demand 

Distribution 

Capacity 

Cost 

Advertising 

Cost 

Holding 

Shortage 

Cost 

Revenue 

per 

Unit  Sold 

Present 

Future 

Present 

?<*■'>„ 

Future 

i  —  l  \lt+  1 

pis  > 

F* 

h 

n  <  1  n  “  1  n  >  1 

Inventory  Level 

T<x;-"> 

+ 

_ 

_ 

_ 

_ 

_ 

+  +  ± 

Net  Inventory  Level 
z(x;  •*) 

+ 

_ 

+ 

— 

— 

±  —  — 

Advertising  Levs.1 
5(x;-’)-*''(8  <*;•’» 

— 

- 

+ 

— 

+ 

+ 

+  +  + 
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TABLE  3  —  Monotonicity  of  the  Myopic  Policies  in  the  Primary  Factors 


Discount 

Rate 

Unit  Ordering 
Cost 

Inventory 

Survival 

Rate 

Demand 

Backlogged 

Rate 

Present 

Next 

c<.+i 

V  +  n 

r>.Kc  r-kc  r  <kc 

Inventory  Level 

+ 

_ 

+ 

+ 

—  invariant  + 

Net  Inventory  Level 
2(x;-„) 

+ 

_ 

+ 

+ 

—  invariant  + 

Advertising  Level 
b(.x\-n)  -  g“'0(x;-„)) 

_ 

_ 

_ 

+  invariant  — 

The  interpretation  of  the  r  suits  reported  in  Table  2  for  the  variations  in  the  stochastic 
demand,  F",  have  been  discussed  above  following  Theorem  5.  The  next  four  entries  on  capa¬ 
city  costs  and  advertising  costs  can  be  analyzed  along  the  lines  of  their  corresponding  aggregate 
costs  H  and  P  (see  Section  3).  The  holding-shortage  cost  function,  s,  is  more  interesting  as  it 
models  two  costs:  when  the  terminal  inventory  is  positive,  it  is  the  cost  of  holding  the  inven¬ 
tory  and  storing  it  and  when  the  terminal  inventory  is  negative,  it  is  the  imputed  cost  for  hav¬ 
ing  unsatisfied  customers  (akin  to  the  penalty  for  not  fulfilling  the  terms  of  a  contract  or,  as  a 
modeling  short  cut,  it  is  a  proxy  to  incorporate  the  loss  resulting  from  a  reduction  in  future 
demands  due  to  bad  service).  For  simplicity,  let  s(z)  =  s+z+  +  s_z~  where  s+,  $_  >  0;  if  s'  is 
greater  than  s,  then  s+  >  s+  and  si  <  s_.  So,  an  incrementally  larger  holding-shortage  cost,  s, 
means  a  larger  holding  cost,  s>,  and  a  smaller  shortage  cost,  s_.  With  this  in  mind,  the  expla¬ 
nation  given  in  Section  3  for  the  variations  of  the  optimal  policies  in  terms  of  S  carries  directly. 
Finally,  if  the  sale  price  in  the  current  period,  r„,  increases,  it  is  natural  to  increase  advertising 
expenditures  as  they  are  more  profitable  and  to  increase  inventory  to  close  these  sales  quickly. 
This  is  not  true  when  v~  >  1,  this  is  not  too  surprising  because,  in  this  case,  as  the  customers 
are  waiting  for  their  order,  they  increase  the  size  of  their  order  (like  hoarding  or  panic-buying 
by  consumers  in  face  of  shortages).  Thus,  by  possibly  having  lower  inventory  on  hand,  the 
increased  demand  by  unserviced  customers  may  more  than  offset  the  penalty  cost. 

In  Table  3,  a  decrease  in  the  cost  of  money  (an  increase  in  the  discount  rate  \)  fosters  an 
increase  in  the  inventory  level  as  inventories  become  cheaper  and  also,  because  inventories  are 
cheaper  to  hold,  sales  through  advertising,  i.e.,  sales  that  are  purchased  to  a  certain  extent  by 
the  seller,  are  reduced.  Finally,  adjustments  in  the  inventory  levels  to  change  in  unit  ordering 
costs  for  the  present  and  the  next  period  follow  a  predictable  pattern:  today’s  price  goes  up, 
postpone  the  purchases;  tomorrow’s  price  goes  up,  build  up  the  inventory  now. 

5.  FLUCTUATIONS  OVER  TIME  OF  THE  OPTIMAL  SOLUTION 

IN  THE  MYOPIC  POLICIES 

In  this  section,  we  will  show  that  the  fluctuations  of  the  optimal  policies  over  ,^me  are 
linked  to  the  variations  of  the  myopic  policies.  The  variations  of  the  myopic  policies  are  taken 
as  given,  no  attempt  is  made  in  this  section  to  link  them  to  variations  of  the  parameters. 
Before  studying  those  fluctuations,  we  must  prove  two  preliminary  lemmas.  First,  establishing 
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a  sufficient  condition  for  the  base  optimal  policies  to  equate  the  base  myopic  policies.  Second, 
ranking  the  base  optimal  policies  with  respect  to  the  base  myopic  policies.  Let  F~  1  (0)  be  the 
infimum  of  those  zsuch  that  F„(z)  >  0. 

LEMMA  3:  lf7?(z*"-  F~H0))  <  theny,n  =  y*  and/3  P*„. 

PROOF:  Since  V  3*  Fn  '(0)  and  y  is  nondecreasing,  y  (ztn  -  U)  <  ytn+'  for  all  values  of 
U.  Thus,  the  point  (y*H.  /3  *")  is  in  the  interior  of  the  half-space  delimited  by 
ti(y  -p  -  F~H  0))  y*"+1,  and  in  that  half-space  EC+,(y(y  ~  P  -  U))  is  constant.  There¬ 
fore,  y*n=  y *  and  /3  •"  =  /3  J,  completing  the  proof. 

LEMMA  4:  Always,  ymn  <  y*  mdP"(x)  >  P„(x)  for  all  x. 

PROOF:  In  equation  (2),  both  S(y  - ) 3)  and  KECn+'  [y(y  -  P  -  (/)]  are  functions  of 
y~P  alone.  In  addition,  C'I+1  is  nondecreasing  on  R;  thus  S'(z)  =  S(z)  4- 
A  ECn+l[y(.z  -  U)  1  is  incrementally  larger  than  S(z).  Hence,  by  Theorem  4,  the  result  follows. 

Lemmas  3  and  4  have  linked  the  myopic  policy  and  the  optimal  policy  in  the  same  period. 
The  next  three  corollaries  establish  a  relationship  between  optimal  policies  in  different  periods. 
To  obtain  simple  results,  we  must  define  a  new  condition  on  : 

(IV)  Either  y*  ^  0  and  y  +(I  <  1  for  all  n,  or  y  +„  =  y  -  „  =  1  for  all  n. 

The  nonnegativity  of  y*  can  be  insured  by  Condition  (III)  or  the  restriction  y  >  0  on  the 
minimizing  space  in  (2). 

COROLLARY  1:  If  Condition  (IV)  holds  and  y*  >  yj+1,  then  yn(x)  >  Jn+1(x)  for  all  x. 

PROOF:  See  Appendix  Corollary  1'. 

COROLLARY  2:  If  Condition  (IV)  holds  and  either  y*  ^  yj+1  and  y*n  >  y*"+\  or 
y *  <  y;+l  and  y*n  >  y*n+1,  then  >"+,(x)  3*  J"+2(x)>-for  all  x. 

PROOF:  See  Appendix  Corollary  2'. 

COROLLARY  3:  If  Condition  (IV)  holds  and  y*H  <  yj,  then  y"(x)  >  y"+l(x)  for  all  x. 

PROOF:  Assume  there  exists  x  such  that  y"(x)  <  y"+1(x).  Because  J"(x)  -  y*n  V  x, 
y*«  <  y*n+'.  By  Lemmas  2  and  3,  since  y(y*"  -  F”'(0))  <  y*"+1,  y*"=  y*„  contradicting  the 
assumption.  This  completes  the  proof. 

These  three  Corollaries  are  the  building  elements  of  our  analysis  of  the  impact  on  optimal 
policies  of  the  variations  over  time  in  the  myopic  policies.  Corollary  1  establishes  that  a  decline 
of  the  myopic  policies  from  one  period  to  the  next,  implies  a  decline  of  the  optimal  policies. 
Corollary  2  states  that,  if  myopic  policies  increase  from  one  period  to  the  next,  while  the 
optimal  policies  decline,  then  the  optimal  policies  must  also  decline  at  a  later  time.  Finally, 
from  Corollary  3,  we  get  that  if  an  optimal  policy  is  strictly  smaller  than  the  corresponding 
myopic  policy,  then  the  optimal  policy  in  the  next  period  is  no  greater  than  the  present  one. 
We  summarize  these  results  in  a  very  simple  theorem.  To  that  end,  we  partition  time  into 
intervals  T,  on  each  of  which  y  *  is  unimodal  in  n. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  28.  NO.  4,  DECEMBER  1981 


FLUCTUATIONS  OF  OPTIMAL  INVENTORY  POLICY 


629 


THEOREM  6:  If  Condition  (IV)  holds  and  y*  is  unimodal  in  n  on  an  interval  T,  then 
J"(x)  is  unimodal  in  n  on  T.  Moreover,  the  least  mode  m'  of  y*n  on  T  does  not  exceed  the 
least  mode  m  of  y*  on  T.  Finally,  y*n—  y*  for  n  <  m'  on  T. 

PROOF:  Let  m  be  the  least  mode  of  y *  on  T.  The  assertion  is  true  if  y*n  -  y *  for  all 
n  €  T.  If  not,  there  exists  a  least  it  6  T  such  that  y*k  <  y*k.  If  k  >  m.  Corollary  1  implies 
that  m'  equals  m.  If  k  <  m,  by  Corollary  3,  yj+|  >  y*k  >  y*k  ^  y**+1.  Applying  Corollary  3 
successively  for  k . m  -  1 ,  one  sees  m '  equals  k  or  k  -  1 .  If  k  -  m,  the  result  is  immedi¬ 

ate,  completing  the  proof. 

Theorem  6  is  a  natural  extension  of  Veinott’s  [24]  original  work  linking  the  optimal  poli¬ 
cies  to  the  myopic  policies  over  time  in  the  context  of  a  one  commodity  inventory  model.  This 
theorem  says  that  when  y J,  the  myopic  policies,  are  unimodal  over  time,  the  optimal  policies 
y J  are  unimodal.  Moreover,  the  mode  of  y*  never  precedes  the  mode  of  y*"\  when  increasing, 
the  optimal  policies  are  equal  to  the  myopic  policies  possibly  with  the  exception  of  the  last  one; 
and  when  optimal  policies  decline  in  one  period,  they  decline  from  that  moment  on  in  the 
interval  T,  i.e.,  until  at  least  the  moment  the  myopic  policies  increase  again  after  having  them¬ 
selves  declined.  All  these  results  have  been  expressed  in  terms  of  y*,  but,  by  Theorem  1,  they 
extend  directly  to  y"(x)  —  y*n  Vx  The  results  are  illustrated  in  Figure  1,  where  the  graph  of 
y*  is  represented  by  the  dashed  line,  the  graph  of  y*n  by  the  solid  line  and,  when  there  is  no 
solid  line,  by  the  dashed  line.  The  graph  of  yH(x)  is  represented  by  the  dotted  line  and,  when 
there  is  no  dotted  line,  by  the  graph  of  y*n.  One  situation  is  not  illustrated  on  the  graph:  the 
case  when  both  the  modes  of  y *"  and  y*  coincide  in  time  with  possibly  y*n  -  y*. 


Inventory 


Time  or  period 


Figure  I.  Fluctuations  of  the  optimal  inventory  policies  y"(x)  and  base  inventory  level 
y*"  as  a  function  of  the  variations  of  the  myopic  policies  y’„  over  time. 
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Similar  results  are  derived  for  the  optimal  net  inventory  policy  which  is  the  difference 
between  optimal  inventory  policy  and  optimal  controlled  demand  generated  by  advertising.  The 
importance  of  deriving  results  for  the  net  inventory  level  will  be  perceived  in  the  next  section, 
as  they  carry  additional  information  about  the  fluctuations  of  the  optimal  policies  over  time. 
First,  a  technical  lemma  is  proved  followed  by  an  adapted  version  of  Corollaries  1 ,  2  and  3  and 
Theorem  6. 

LEMMA  5:  If  v-„  “  0,  yj+i  -  0,  and  z*"  <  z*"+1,  then  y*"  -  y*  and|3  /8  J. 

PROOF:  Since  z*n  <  z*n+l  —  -/3  •n+1  <  0,  the  point  O'*".  /3  •")  is  in  the  interior  of  the 
half  space  {y  - /3  <  0}.  And  on  that  half  space  £C"+1(T)„(y  -  0  -  U))  is  constant  because 
t)  -  0.  Therefore,  y*"  -  y *  and  /3  •"  -  /3  *  by  the  convexity  of  the  three  first  terms  in  (2)  com¬ 
pleting  the  proof. 

COROLLARY  4:  If  Condition  (IV)  holds  and  z*  >  *•+,,  then  z*"  >  z*"+l. 

PROOF:  Assume  the  contrary,  i.e.,  z*"  <  z*"+l.  By  Condition  (IV),  there  are  three  pos¬ 
sibilities,  viz.  (i)  t)+B  -  -  1,  (ii)  y*"  +  l  ^  0  and  y*"+1  V  r >  0  and  (iii)  y*"+1  -  0  and 

Tj_„  -  0.  In  the  first  two  cases  t )„(z*")  <  y*"+l  so  z*"-  z*  by  Lemma  3.  In  the  third  case 
z*"-  z*  by  Lemma  5.  Thus,  by  Lemma  4,  in  all  cases  z*"“  zj  ^  zJ+)  >  z*"+l,  which  is  a 
contradiction  and  completes  the  proof. 

COROLLARY  5:  If  Condition  (IV)  holds  and  z*"  <  zj,  then  z*”  >  z*"+l. 

PROOF:  Assume  the  contrary,  i.e.,  z*n  <  z*"  +  l.  It  follows  from  Condition  (IV),  as  in 
the  proof  of  Corollary  4,  that  z*"=*  zj,  which  is  a  contradiction  and  completes  the  proof. 

COROLLARY  6:  If  Condition  (IV)  holds  and  if  either  zj  <  zj+i  and  z**  >  z**+l  or 
z*  <  zj+|  and  z*"  >  z*"+l,  then  z*"+1  >  z*B+2. 

PROOF:  Assume  the  contrary,  i.e.,  z*B+l  <  z*"+2.  Then  from  Condition  (IV),  as  in  the 
proof  of  Corollary  4,  z*"+l  -  z$+,.  Hence,  by  Lemma  4,  z*"+1  -  z*+\  ^  z*  >  z*"  ^  z*H+l 
with  either  the  first  or  third  inequality  being  strict.  This  is  a  contradiction  and  completes  the 
proof. 

THEOREM  7:  If  Condition  (IV)  holds  and  z *  is  unimodal  in  n  on  T,  then  z*"  is  unimo- 
dal  in  n  on  T.  Moreover,  the  least  mode  m  "  of  z*"  on  T  does  not  exceed  the  least  mode  of  z* 
on  T.  Finally,  z*n  —  zj  for  n  <  m" on  T. 

PROOF:  Identical  to  that  of  Theorem  6  with  Corollaries  4  and  6  replacing  1  and  3. 

We  can  now  define  a  new  condition  to  assure  strict  results  in  Corollaries  1,  2,  3,  4,  S, 
and  6: 

(V)  Ift,+B-  1.  then  f-'(0)  >  0. 

In  all  the  results  concerning  the  base  inventory,  y*",  in  Corollaries  1 ,  2  and  3  and  the  base  net 
inventory,  z*"  in  Corollaries  4,  S  and  6,  the  last  inequality  sign  can  be  replaced  by  a  strict  one, 
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under  Condition  (V).  This  is  so  because,  if  either  rj+B  <  1  or  “  1  and  F„  '(0)  >  0, 
z*"  z*"+l  implies t, (z*n  -  U)  <  y*"+1. 

6.  FLUCTUATIONS  OVER  TIME  OF  THE  OPTIMAL  SOLUTION 
IN  THE  AGGREGATE  COSTS 

In  this  section,  we  link  the  fluctuations  of  the  optimal  solutions  with  the  variations  over 
time  of  one  aggregate  cost  while  the  other  two  remain  stationary.  Thus,  we  can  tell,  for  exam¬ 
ple,  when  the  advertising  costs  change  with  the  seasons,  what  should  be  the  pattern  over  time 
of  the  optimal  policies  without  any  computations.  Clearly,  without  computations,  we  cannot 
hope  to  find  the  exact  value  of  the  optimal  policies,  but  many  potential  candidates  can  be  dis¬ 
carded  at  a  glance.  Say,  optimal  advertising  expenditures  must  peak  during  the  cheapest  month 
for  advertising,  so  any  planned  expenditures  over  time  which  do  not  peak  at  that  time,  are  not 
optimal  and  should  not  be  implemented  with  that  timing. 

For  notational  simplicity,  let  y*(<o„)  denote  the  myopic  base  inventory  in  period  n  when  o> 
varies  from  one  period  to  next;  as  usual,  <*»  can  take  the  value  //,  P  or  S.  This  obviously 
applies  to  all  other  policies  as  well.  The  reader  should  note  that,  for  the  first  time,  stationarity 
over  time  of  certain  costs  is  required.  By  convention,  only  the  parameters  specifically  exhibited 
vary  over  time.  Also,  the  reader  should  be  aware  that  comparing  y*(tu")  and  y*(w"+l)  is  not 
that  simple  even  when  <on  is  comparable  to  <u„+1  (larger  or  smaller  by  the  incremental  order¬ 
ing),  because  we  are  comparing  ( <o„ ,  «in+|.  <u„+2.  •••)  with  (o>n+|,  ton+2,  ...),  not  (w„,  a>n+i, 
<on+2*  •  ■  •)  with  (&),',  o^'+i,  aV+2-  ••■)  as  was  done  in  Sections  3  and  4,  the  former  are  not  com¬ 
parable,  while  the  latter  were.  However,  by  Theorems  2,  3  and  4,  we  can  derive  results  on  the 
myopic  base  stocks  because  we  compare  the  period  one  by  one,  i.e.,  compare  a >„  to  a»n+i,  not 
at"  to  Since  all  other  parameters  are  stationary,  comparing  &>„  to  wn+1  is  equivalent  to 

comparing  w„  with  a^'  in  the  same  period.  Therefore,  if  ^  &>n+1,  and  y*(<nn)  is  increasing  in 
*>„,  then  y*{<o„)  is  increasing  in  a>„  and  y*(a>„)  <  y*(wn+|).  Expanding  on  this  theme,  let  o>„, 

n  «■  1 . N  be  such  that  <a„  is  comparable  to  «„+)  for  all  n.  It  follows  that  all  the  y*(o>„)  are 

comparable  with  the  same  pattern.  Note  that  to„  and  w„+2  are  not  necessarily  comparable 
because  possibly  u,  <  «in+i  and  wn+1  >  o>n+j.  Thus,  we  can  replace  the  conditions  on  the 
myopic  base  stocks  in  Theorems  6  and  7  by  a  condition  on  the  parameters  directly  and  the  con¬ 
clusions  of  these  theorems  still  hold. 

The  rest  of  this  section  is  concerned  with  establishing  the  relationship  over  time  of  the 
fluctuations  of  the  optimal  policies  and  of  the  base  stocks  with  the  variations  of  the  aggregate 
costs.  Corresponding  to  the  variations  of  each  of  these  aggregate  costs,  there  are  types  of 
fluctuations  for  the  optimal  policies  labeled  Type  P ,  H  and  S  in  the  obvious  fashion.  This  con¬ 
vention  will  prove  useful  when  primary  costs  are  examined  in  the  next  section. 

THEOREM  8:  (Type  P ’).  Let  Condition  (IV)  hold.  If  the  model  is  stationary,  except 
possibly  for  the  P„ ,  and  if  P„  and  P„+\  are  comparable  for  all  n,  the  myopic  policy  is  optimal  in 
every  period  for  all  intitial  inventories  x  ^  y*(Pt).  Moreover,  (y*(P "),  j8  *(P"), 
z*(/»»))-  (y*(Pn)yp*(p„),z*(Pn)). 

PROOF:  See  Appendix. 

If  x  ^  y*(P j),  the  optimal  policy  (yn(x).  Pn{x))  is  equal  to  the  base  myopic  policy 
O'*.  /3  *) .  It  says  that  when  advertising  costs  vary  over  time,  the  manager  should  react  as  if 
short-sighted,  i.e.,  pay  no  attention  to  the  parameters  of  future  periods.  Also,  both  advertising 
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expenditures  and  inventory  levels  are  countercyclical  to  variations  in  the  advertising  cost  func¬ 
tions,  with  perfectly  matching  cycles.  Noting  that  the  net  inventory  positions  are  procyclical 
indicates  that  the  induced  fluctuations  over  time  are  larger  for  the  advertising  expenditures  than 
for  the  inventory.  These  results  are  what  we  would  have  expected,  at  least  regarding  the  anti- 
cyclical  behavior  of  advertising  expenditures  and  inventory  positions.  The  results  of  this 
theorem  are  illustrated  in  Figure  2,  where  >*(/*"),  j3  •(/*"),  and  zMP")  are  plotted.  The  solid 
line  represents  y*(PR),  the  dashed-dotted  line  /3  *(/*")  and  the  dotted  line  z*(Pa).  The  reader 
should  not  forget  that  the  driving  force  behind  these  fluctuations  are  the  variations  of  Pn ,  which 
are  not  represented  on  the  graph— they  are  on  an  uncomparable  scale— and  which  are  perfectly 
anticyclical  to  y*(P„ )  -  y*(P").  For  continuity  of  the  representation  of  the  optimal  solutions, 
the  graphs  of  y*(P")  in  Figure  2  and  y *  in  Figure  1  are  identical. 


Time  or  period 


Figure  2.  Fluctuations  (Type  P)  of  the  base  stocks  with  respect  to  variations  in  P„ 
over  time,  whose  variations  are  perfectly  anticyclical  to  those  of  y’(P„). 

Next,  we  examine  the  fluctuations  (Type  H)  induced  by  the  variations  of  the  aggregate 
ordering-capacity  cost. 
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THEOREM  9:  (Type  H).  Let  Condition  (IV)  hold.  If  the  model  is  stationary,  except 
possibly  for  the  H„ ,  and  if  H„  in  $t*  is  unimodal  in  n  on  an  interval  T  with  least  mode  m,  then 
y*(Hn )  and  z*(Hn)  are  unimodal  in  n  on  T  with  respective  least  modes  m'  and  m",  such  that 
Moreover,  (. v‘(//"),  / 8*(,Hn),  z*(H"))  -  (y*(H„),  z'(H„))  for 

n  <  m  ",  and  0  *(//")  is  nondecreasing  in  n  <  m. 

PROOF:  See  Appendix. 

As  expected  from  the  discussion  following  Theorem  2,  the  base  levels  y*,  0  *  and  z*,  vary 
in  an  anticyclical  manner  to  the  variations  of  the  aggregate  ordering-capacity  cost  overtime. 
The  time  of  highest  cost  corresponds  to  the  time  of  lowest  base  levels,  but  the  base  inventory 
and  net  inventory  peak  before  the  moment  when  the  aggregate  cost  reaches  its  lowest  level. 
Since  inventory  is  not  necessarily  sold  immediately,  the  inventory  purchases  of  yesterday  may 
last  until  tomorrow  when  the  costs  start  rising  again;  so,  in  anticipation,  inventory  levels  are 
curtailed  while  the  costs  are  still  declining.  On  the  contrary,  advertising  expenditures  which 
generate  current  sales  do  not  peak  before  the  time  of  lowest  cost.  The  results  of  Theorem  9 
are  illustrated  in  Figure  3,  whence  the  convention  used  in  Figure  2  is  maintained;  in  addition 
y*(H„ )  is  pictured  by  a  dashed  line,  and  the  variations  of  Hn  not  pictured  in  Figure  3  are 
exactly  anticyclical  to  those  of  y*(Hn). 


Time  or  period 


Figure  J.  Fluctuations  (Type  H)  of  the  base  stocks  with  respect  to  the  variations  in  H„  over  lime, 
whose  variations  are  perfectly  anticyclical  to  those  of  .v*(W,). 
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Finally,  we  turn  our  attention  to  the  fluctuations  (Type  S)  induced  by  the  variations  over 
time  in  the  aggregate  shortage  holding  costs,  S„ . 

THEOREM  10:  (Type  S).  Let  Condition  (IV)  hold.  If  the  model  is  stationary,  except 
possibly  for  the  S„ ,  and  if  S„  in^*  is  unimodal  in  n  on  an  interval  T ,  then  y*(S"),  z*(S")  and 
P  *(S")  are  unimodal  in  n  on  T.  Also,  the  least  mode  m'  of  z*(S")  is  no  larger  than  that  of  S„. 
Moreover,  each  mode  of  z*(S")  is  no  larger  than  that  of  S„.  Moreover,  each  mode  of  z*(S")  is 
a  mode  ofy*(S")  and  -p  *($"),  and  (v*(S"),  p  ‘(S"),  z*(S"))-  0>*(S„),  p  *(S„),  z*(S„))  for 
n  <  m'. 

PROOF:  See  Appendix. 

Again,  as  expected  from  the  discussion  following  Theorem  4,  the  base  levels  of  y*,  — / 8  * 
and  z*  vary  in  an  anticyclical  manner  to  the  variations  of  the  aggregate  shortage  holding  cost 
over  time.  The  time  of  highest  cost  corresponds  to  the  moment  of  lowest  inventory  and  net 
inventory  positions  and  highest  advertising  expenditures.  For  the  same  reason  as  in  the  previ¬ 
ous  theorem,  the  inventory  peaks  before  the  time  of  lowest  cost,  in  anticipation  of  future 
higher  costs.  This  particular  type  will  be  discussed  further  in  the  next  section  in  conjunction 
with  variations  in  the  demand.  These  results  are  illustrated  in  Figure  4  where  the  conventions 
used  in  Figure  3  are  maintained. 


Figure  4.  Fluctuations  (Type  S)  of  the  base  stocks  with  respect  to  variations  in  S„  over  time,  whose 
variations  are  perfectly  anticyclical  to  those  of  v*(S„). 
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As  mentioned  earlier,  results  of  Theorems  8,  9  and  10  extend  directly  from  the  base 
inventory  levels  y'n  to  optimal  inventory  levels  y"(x)  because  v”(x)  =  y*n  V  x.  They  extend 
also  toB'Tx)  and  z"(x)  when  the  myopic  policies  are  optimal  using  Theorems  2,  3  and  4.  But, 
when  myopic  policies  are  not  optimal  (see  Theorems  9  and  10),  they  cannot  be  extended  to 
jS'Tx)  and  zB(x),  because  in  Lemma  3,  0  *  =  0  *"  does  not  imply  /3„(x)  =  /3"(x);  however,  the 
fluctuations  of  /3"(x)  and  z"(x)  are  unlikely  to  deviate  strongly  from  /3  *"  and  z*\  though  a 
counter  example  should  not  be  hard  to  construct. 

7.  FLUCTUATIONS  OVER  TIME  OF  THE  OPTIMAL  SOLUTION  IN  THE 
PRIMARY  COSTS  AND  IN  THE  BASIC  DEMAND 

In  this  section,  we  will  establish  that,  under  variations  of  either  the  primary  costs  or  the 
basic  demand,  the  fluctuations  of  the  optimal  solution  are  of  either  Type  H ,  P \  or  S,  as  defined 
by  Theorems  8,  9  and  10.  Since  Condition  (IV)  holds  in  every  theorem,  we  will  assume 
throughout  this  section  that  Condition  (IV)  holds. 

Also  for  clarity,  the  results,  which  are  obtained  directly  from  Tables  2  and  3,  will  be  sum¬ 
marized  in  Table  4.  The  first  column  of  the  table  contains  the  nonstationary  factor  in  each 
case;  once  more,  we  remind  the  reader  that  only  one  factor  in  each  case  is  nonstationary  over 
time.  The  second  column  indicates  the  ordering  used.  In  the  third  column,  a  minus  shows 
that  the  order  in  the  set  is  reversed,  and  a  plus  is  used  otherwise.  Note  that  in  Theorems  8,  9 
and  10  the  order  is  reversed.  In  the  fourth  column,  the  types  of  fluctuations  are  indicated  by 
either  H,  P  or  S,  as  defined  in  the  previous  section. 


TABLE  4  —  Fluctuations  of  the  Optimal  Policies  with  the  Primary  Costs  and  Demand 


Nonstationary  parameter 

Ordering 

Type 

Natural  (  +  ) 
Reversed  (— ) 

Fluctuation 

Type 

Advertising  cost  p(b~l) 

incremental 

- 

P 

Sales  price  r,  (tj_  -  1) 

real 

+ 

P 

Capacity  cost  h 

incremental 

- 

H 

Backlogged  demand  factor  n (r«Xc) 

real 

invariant 

Backlogged  demand  factor »?_,  (r  <  Ac) 

real 

+ 

S 

Backlogged  demand  factor^-,  (r  >  \c) 

real 

- 

S 

Inventory  preserving  factor  rj  + 

real 

4- 

S 

Discount  factor  A 

real 

+ 

S 

Holding  and  shortage  cost  s 

incremental 

- 

S 

Basic  demand  distribution  F 

stochastic 

+ 

S 

Most  of  the  results  in  Table  4  can  be  interpreted  along  the  lines  that  followed  Theorems 
8,  9  and  10  and  the  reader  is  referred  back  to  the  previous  section.  There  are,  however,  three 
cases  that  retain  our  attention:  variations  in  selling  price,  in  the  discount  rate  (cost  of  financing 
the  operation)  and  in  the  stochastic  demand.  The  strongest  results  are  obtained  for  the  varia¬ 
tions  in  selling  price  when  unsatisfied  customers  wait  till  their  order  is  serviced.  In  that  case, 
the  optimal  inventory  level  and  the  advertising  expenditures  move  in  perfect  harmony  with  the 
variations  in  price,  while  the  net  inventory  position  varies  in  the  opposite  direction  implying 
that  the  swings  of  the  advertising  expenditures  are  relatively  greater  than  those  of  the  inventory 
position.  When  the  demand  varies  over  time,  the  inventory  level  and  the  net  inventory 
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fluctuate  in  the  same  fashion  while  the  advertising  expenditures  are  countercyclical.  In  this 
case,  the  peaks  of  inventory  precede  those  of  the  demand;  as  sales  are  uncertain,  no  one  wants 
to  have  a  too  large  inventory  on  hand  during  the  decline  in  demand  following  the  peak.  The 
cautious  (and  correct)  approach  is  to  retrench  early.  There  is  no  need  for  this  cautious 

approach  when  prices  vary,  because  in  that  case  the  demand  is  stable,  and  the  objective  is  to 

sell  at  the  right  time  and,  at  no  time  is  it  better  than  when  the  prices  peak.  In  a  more  general 
and  realistic  model,  both  demand  and  prices  vary  simultaneously,  likely  in  harmony.  Our 
results  state  clearly  that  the  inventory  level  should  follow  that  pattern,  but  they  are  ambiguous 
regarding  advertising  expenditures.  Finally,  when  the  discount  rate  increases  or,  equivalently, 
when  the  interest  rate  declines,  the  financial  cost  of  holding  the  inventory  (imbedded  in  this 
model  as  the  discounted  value  of  the  terminal  inventory  position)  is  reduced  and  it  becomes 
cheaper  to  hold  larger  inventory.  As  expected,  advertising  expenditures  rise  as  the  interest  rate 
rises  to  help  liquidate  the  inventory  position  which  has  become  costlier. 

One  case  is  not  covered  in  this  table,  namely  the  variations  of  rover  time  when  tj_  <  1 . 
Let  r„  be  unimodal  in  n  on  T.  Thus,  under  Condition  (IV)  and  by  Table  2,  y*(r")  is  unimodal 
in  ft  on  T  with  least  mode  m  not  exceeding  the  least  mode  of  r„.  Also,  by  Theorem  6, 

P  *(/•"-')  =  li  *(r„_,)  P*(rn)  =  p  *(rn)  for  n  <  m.  In  addition,  /3‘(rm  i)  =  /3*(rm_,)  ^ 

P*(rm)  ^  P*(rm),  by  Lemma  4.  Moreover,  if  v'(r")  >  v'{r"+])  and  rn  ^  r„  +  1, 

z*(rn)  >  v(z*(r")  ~  >  y*(rn  +  ')  Z  z*(rn  +  1)  by  Lemma  3;  thus,  /3*<r")  = 

P  (z* (/•");  r„)  is  nonincreasing  in  :*(rn)  and  —  rn,  by  an  argument  associated  with  equation  (15). 
Summarizing  p  *(r")  <  P  *(r"  +  1)  when  r„  <  /•„+,.  So  we  increase  the  inventory  position  when 
the  sale  price  increases,  but  we  start  to  reduce  it  before  prices  peak.  On  the  other  hand,  as  long 
as  prices  rise,  advertising  expenditures  rise,  as  their  effect  results  in  an  immediate  sale. 

Obsolescence  Probabilities 

The  discount  factors  A„  with  0  <  A„  ^  1  can  be  interpretd  as  the  probability  that  the  pro¬ 
cess  will  continue  in  period  n  4-  1  given  that  the  process  is  active  in  period  n.  Thus,  1  -  X,  is 
the  obsolescence  probability  in  period  n  +  1  Pierskalla  [  1 01  established  that,  with  nondecreasing 
obsolescence  probabilities,  i.e.,  A„  >  A„  +  1  for  all  n  and  a  positive  ordering  cost,  the  base  stocks 
are  such  that  y*1  >  . . .  >  y*\  In  Table  4,  we  obtain  a  more  general  result  concerning  the 
relationship  between  the  obsolescence  probabilities  and  the  base  stocks.  For  example,  if  A  „  is 
unimodal  in  «,  then  y*n  is  unimodal  in  n  with  the  time  of  its  ieast  peak  not  exceeding  that  of 
A,.  If  A  „  is  nonincreasing  in  n,  we  obtain  Pierskalla's  result  that  the  same  is  so  of  >•*". 

8.  EXTENSIONS 

In  this  section,  we  generalize  the  model  in  two  separate  directions,  convex  ordering  cost 
for  the  inventory  and  common  time  lag  in  delivery  and  advertising. 

First,  if  we  replace  the  previous  inventory  ordering  cost  by  the  convex  ordering  cost 
c(z)  -  c(z)  +  cz  in  (2),  we  obtain: 

THEOREM  14:  If  17  is  linear,  the  assertions  of  Tables  1,  2  and  3  continue  to  hold  for  the 
model  of  this  section. 
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PROOF:  Upon  noticing  that  the  additional  term  c(y  —  x)  in  (2)  is  int  .eni  of  H ,  P 
and  S,  the  proofs  of  Theorems  2,  3  and  4  can  be  used  directly  since  tj  is  linear.  I  hus,  Tables  1 , 
2  and  3  hold  true  in  this  case,  completing  the  proof. 

Karlin  IS]  claimed  that,  with  convex  ordering  cost  and  no  advertising,  i.e.,  /3(x)  =  0,  the 
fluctuations  of  the  optimal  inventory  policy  can  be  linked  to  the  variations  of  the  demand  distri¬ 
bution  over  time.  Upon  reflection,  this  seems  unlikely  as  the  op\mal  policy  responds  not  only 
to  variations  in  demand  but  also  must  smooth  the  size  of  the  orders  from  one  period  to  the 
next,  due  to  the  added  constraint  of  convex  ordering  cost.  The  following  example  provides  a 
counterexample  to  an  analogue  of  Corollary  1  for  convex  ordering  cost  and  no  advertising  It 
will  show  that  even  if  the  myopic  policy  falls  and  rises,  the  optimal  policy  keeps  rising  to 
smooth  the  ordering  cost  overtime.  Without  such  an  analogue  corollary,  it  is  not  possible  to 
establish  a  theorem  analogue  to  Theorem  6,  and  therefore  t.ie  fluctuations  of  the  optimal  policy 
cannot  be  predicted  from  those  of  the  myopic  policy. 

EXAMPLE:  Let  r,  -  1,  H  =  0,  P  -  0,  c(z)  -  S(z)  «  zJ/2,  t/1  —  (10. 0. 100)  and 
N  ~  3.  The  myopic  policies  in  each  period  are  J|(x;  10)  ■*  ((10  +  x)/2)  V  x, 
y2(x,0)  -  (x/2)  V  x  and  v3(x;  100)  =  ((100  +  x)/2)  V  x.  The  optimal  policies  in  each  period 
are:  j'(x;  10. 0. 100)  -  ((5x  +  180)/13)  V  x,  J2(x;0, 100)  «  ((2x  +  100)/S)  V  x  and 

J^lxilOO)-  ((100  +  x)/2)  V  x.  Therefore,  when  x  «  0,  J1(0;10)-5>  J'2(0;0)  -  0  < 
,vj(0;100)-  50,  but  y'(0;10, 0,100)  -  13  11/13  <  y2(0;0, 100)  «  20  <  yJ(0;I00)«  50. 

Second,  if  a  common  interval  of  time,  v,  is  allowed  to  occur  between  the  moments  of 
product  ordering  and  of  advertising  purchase  and  the  moments  of  the  delivery  of  the  product 
and  of  the  advertising  effect,  all  the  results  of  the  previous  sections  hold,  provided  that 
t?+  -  tj_  -  1  and  (5)  is  replaced  by 

S(z )  =»  Es(z  -  V)  —  A  c'z, 

where  V  is  a  random  variable  whose  distribution  is  the  convolution  of  F„  ,  ....  F„+v.  In  partic¬ 
ular,  if  the  comparison  criteria  F„ C  Fn+l  or  F„  D  F„+i  is  replaced  by  F„  C  Fn+v  +  i  or 
F„  D  Fn+t,  + 1,  as  in  Veinott  [17),  the  results  of  Table  4  apply  to  the  fluctuations  of  the  basic 
demand  distribution. 

9.  CONCLUSIONS 

This  paper  is  an  example  of  the  use  of  dynamic  programming  in  a  nonstationary  environ¬ 
ment  where  properties  of  the  optimal  policies  were  derived  successfully  without  resorting  to 
numerical  computations  or  severe  limitations  on  the  cost  functions— such  as  restricting  them  to 
be  quadratic.  The  methodology  used  in  this  paper  can  be  applied  directly  to  discrete  time  con¬ 
trol  problems,  if  the  controls  and  nonstationary  parameters  of  interest  exhibit  some  subaddi¬ 
tivity  properties. 

To  come  back  to  the  specific  model  of  inventory-advertisement  treated  in  this  paper,  its 
most  peculiar  feature  is  the  fact  that  advertising  expenditures  should  take  place  during  periods 
of  low  demand  so  as  to  smooth  total  demand  over  time.  This  corresponds  well  to  our  notion  of 
a  sales  campaign  during  the  low  season  (cars  in  January,  appliances  around  the  same  time, 
winter  equipment  around  spring  time,  etc  ).  However,  it  does  not  capture  the  fact  that  often 
advertising  expenditures  peaked  just  prior  to  or  at  the  peak  of  the  season  (ice  tea  commercials 
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on  TV  during  the  summer,  etc  ).  A  more  complex  demand  function  is  needed  to  handle  this 
problem  and  possibly  the  simultaneous  presence  of  two  types  of  advertisement— procyclical  and 
countercyclical  Another  factor  that  is  neglected  in  this  paper  is  the  simultaneous  presence  of 
more  than  one  firm  in  the  market,  firms  which  compete  for  many  of  the  same  customers.  This 
gaming  factor  could  very  well  reverse  the  results  obtained  here,  but  I  would  not  like  to  venture 
a  guess  at  this  stage. 
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APPENDIX 


THEOREM  2: 

PROOF:  Since  S(z)  +  \£CMz  -  (/);  H"+>]  is  convex  in  z,  the  term  inside  the  braces 
in  (2)  is  subadditive  in  (y.0 .H„)  on  R  x  R+  x  jf;  by  examples  A7  and  A8  and  Lemma  1. 
Therefore,  by  Theorem  A2,  the  least  element,  (yU;  H"),P(x.  «")),  minimizing  the  right  hand 
side  of  (2)  over  the  sublattice  (y  >  x,P  >  0)  is  nondecreasing  in  H„  Letting  the  sub¬ 

lattice  be  (y  -  x,  /3  ^  0),  we  obtain  directly  that  fi(x,Hn)  is  independent  of  Hn. 

On  eliminating/?  by  making  the  change  of  variables,  z  **  y  —  /?,  recursion  (2)  becomes 

(6)  C(x,H")-  min  {//„(y)  +  P(y  -  z)  +  S(z)  +  A  £Cfo(z  -  UY,  A/"+1]>. 

y'&x.y'Zi 

We  now  show  inductively  that  C(x\H")  is  subadditive  in  Or.  H")  on  R  x  %”m.  Since 
COr, H*+l)  =  0,  the  result  holds  trivially  for  COr; HN+i).  By  the  induction  hypothesis,  exam¬ 
ple  A5,  and  the  fact  t>  is  nondecreasing,  \£C[»)(z  -  U)\ Hn+l]  is  subadditive  in  ( z,Hn+ ')  on 
R  x  Since  P  is  convex,  by  examples  A6  and  A8  and  Lemma  1,  the  term  inside  the 

braces  in  (14)  is  subadditive  in  (y.z.H n)  on  R2  x  By  Theorem  Al,  C(x,Hn)  is  subaddi¬ 
tive  in  ( x,H" )  on  R  x/T"*,  terminating  the  induction.  By  Theorem  A2,  the  2-lexicographicaly 
least  element,  (yOr;//"),  z(x;  //")),  is  nonincreasing  in  H"  on  ft". 

If  we  make  the  same  change  of  variables  used  above  but  instead  eliminate  y,  recursion  (2) 
becomes 

(7)  C(x\H")  -  min  {H„<0  -  (-z))  +  P(0)  +  S(-(-z)) 

+  X  £C{tj(_ (— z)—  (/);  W"+1]}. 
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The  term  inside  the  braces  in  (7)  is  subadditive  in  (-z.fi  .H”*1)  on  R  x  R+  x  Jti  for  i  >  n  by 
examples  A6  and  A8  because  H„  is  convex.  By  Theorem  A2,  the  least  element,  /3(x;//"+1) 
right  hand  side  of  (7)  is  nondecreasing  in  //"+l  on  $(".  Let  y  -  x,  so  the  right  hand  side  of  (2) 
becomes 

(8)  min  [H„(x)  +  Pit 3)  +  Six  - 1 3)  +  X£CM*  -fi  -  (/); //"+1]) 

0>O 

The  term  inside  the  braces  in  (8)  is  subadditive  in  (fi  ,H,)  on  /?+  xf '  for  i  >  n.  By  Theorem 
A2,  the  least  element,  fi (x;//n+1)  minimizing  (8)  over  i/3  >  0)  is  nondecreasing  in  W"+1  on 
completing  the  proof. 

The  proofs  of  Theorems  3  and  4  are  sketchy,  as  they  follow  the  line  of  argument  used  in 
Theorem  2.  For  detailed  proofs,  see  Balcer  12]. 

THEOREM  3. 

PROOF:  On  eliminating  /3  by  making  the  change  of  variables,  z  -  y  -  / 3,  recursion  (2) 
becomes 

(9)  C(x\P")  —  min  {//(y)  +  P„(y  -  z)  +  Siz)  +  X  £Cfo  (z  -  U)\ P"+l]). 

y  >  x.y  >  z 

By  induction  C(x,P ")  is  subadditive  in  ( x.Pj )  on  R  x^'ffor  /  —  n . N.  By  Theorem  A2 

and  equation  (9),  the  2-lexicographically  least  element,  (y(x,P").  z(x;P")),  minimizing  the 
right  hand  side  of  (9)  over  (y  >  x.  y  >  z),  is  nonincreasing  in  P"+ 1  on^"  By  Theorem  A2 
and  equation  (2),  the  least  element,  (y  (x;/>"),  /3  (x;  £")),  minimizing  the  right  hand  side  of  (2) 
over  {y  ^  x,  /3  >  0),  is  nonincreasing  in  P„  on g>n. 

If  we  make  the  same  change  of  variables  used  above  but  instead  eliminate  y,  the  right 
hand  srie  of  recursion  (2)  becomes 

(10)  min  {//(/3  —  (— z))  +  P„(fi)  +  S  < —  ( —  z ) )  +  X  £Cbi  (— (— z)—  (/);  />n+1]} 

By  Theorem  A2,  the  2-lexicographically  least  element,  (/3(x;/>n),  - z(x\Pn )),  minimizing  the 
term  inside  the  braces  in  (10)  is  nonincreasing  in  P„  and  nondecreasing  in  P"+l  onP". 

In  equation  (2),  let  y  -  x  so  its  right-hand  side  becomes 

(11)  min{//(x)  +  Pn0 3)  +  Six  -  /3)  +  X  £Cfo  (x  -  /3  -  (/);£"+,l) 

Thus,  by  Theorem  A2,  the  least  element,  4  ix;P"),  is  nondecreasing  in  />"+l  and  nonincreasing 
on  P„  on^".  This  completes  the  proof. 

THEOREM  4. 

PROOF:  First,  on  making  the  change  of  variables /3  -  y  -  z,  recursion  (2)  becomes 

(12)  C(x;S")  -  min  [Hiy)  +  Piy  -  z)  +  S„(z)  +  X  £Cfo  (z  -  U):  S"+,]| 

y*x.y>  i 

By  induction  that  C(x;S")  is  subadditive  in  (x,S")  on  R  x  By  Theorem  A2,  the  2- 
lexicographically  least  element,  (y(x;S"),  z(x;S")),  of  the  minimizing  set  M"(x)  is  nonincreas¬ 
ing  in  S "  on  <f". 
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If  instead  of  eliminating  /S,  we  eliminate  y  by  the  change  of  variables  y  -  (3  -  z,  recursion 
(2)  becomes 

(13)  C(x;S")  —  min  (//(/3  -  (-*))  +  £(/3)  +  S„(-(-z)) 

a-(~z»x.a>o 

+  A£C(r,(-(-r)  -  i/);S"+l]|. 

By  Theorem  A2,  the  2-lexicographically  least  element,  (/3(x;S" ),  -z(x,S")),  minimizing  the 
term  in  braces  in  the  right  hand  side  of  (13)  is  nonincreasing  in  Sn  on  <jf". 

If  we  set  y  -  x,  the  right  hand  side  of  (2)  becomes 

(14)  min  (//Or)  +  /><£)  +  S„(x  -  0)  +  A£C[tj(x  -  0  -  i/);S"+l]). 

By  Theorem  A2,  the  least  element,  4  Or; 5"),  minimizing  the  term  inside  the  braces  in  (14),  is 
nondecreasing  in  S"  on  </",  completing  the  proof. 

Letij(x)  -  rjOr)  V  x.  sorj(x)  -  (tj+  V  l)x+  -  (tj_  A  1  )jc~  . 

COROLLARY  1:  If  y\  >  y*+1,  then  v(y"(x))  >  y"+1(x)  for  all  x. 

PROOF:  Suppose  there  is  an  x  such  that  rj(y"(x))  <  y"+,(x).  We  must  have 
x  <  y *"  V  y*"+l  for  if  not,  tj(x)  <  x  <  rj(x)  which  is  a  contradiction.  Also,  x  >  y *"  A  y*"+l 
for  if  not,  Tj(z*"-  £„“  *(()))  <  t)(y*")  <  y*"+l.  Then  by  Lemmas  3  and  4,  ytn-  y*  > 
y J+i  >  y*"+l  >  ^ (y*")  >  y*\  a  contradiction.  Thus,  we  have  y**A  y»"+l  <  *  < 

y*"  Vy*"+I.  Now  if  y*"  <  y*"+1,  then  Tj(y*")  <  tj(x)  <  y*"+1,  so  by  the  previous  argument 
we  have  a  contradiction.  If  instead  y*"+l  <  y*n,  then  y*n  ^  fj  (y*n)  <  x  <  y*\  a  contradic¬ 
tion.  This  completes  the  proof. 

COROLLARY  2'.  If  either  y*  4  yj+,  and  y»"  >  y*n+\  or  yj  <  yj+1  and  y*"  ^  y*"+l, 
then  t,(z-+1  -  £-+*,  (0))  5s  y*"+2  and /j(y"+1(x))  ^  y"+2(x)  for  all  x. 

PROOF:  Assume  tj(z*"+i  —  £„~+|  (0))  <  y*"+2.  By  Lemmas  3  and  4,  y*"+l  —  yj+,  > 
yj  ^  y*"  >  yM+l,  so  equality  occurs  throughout.  This  contradicts  v*n  >  y*"+l  in  the  first  case 
and  y *  <  yj+|  in  the  second.  Suppose  there  is  an  x  such  that  ij(y"+,(x))  <  y"+2(x).  We 
must  have  x  <  y*"+1  V y'"+2  for  if  not,  rj(x)  <  x  4  tj(x),  a  contradiction.  Also, 
x  >  y*"+l  A  y*"+2,  for  if  not,  Tj(y*',+  1)  <  y*"+2,  a  contradiction.  Thus,  y*"+1  A 
y*"+2  <  X  <  y*"+1  Vy-+2.  Now  if  y"+l  <  y*"+2,  then  ij(y*"+l)  <  ij(x)  <  y‘"+2.  a  con¬ 
tradiction.  If  instead  y*"+l  >  y*"+2,  then  y*"+l  ^  rj(y*"+l)  <  x  <  y*"+1,  a  contradiction, 
completing  the  proof. 

THEOREM  8. 

PROOF:  If  P„  4  then  by  Theorem  3,  z*<£„)  4  z*(Pn+l).  Thus,  if  y'IPn)  5>  0  for 
all  n  and  tj  +  <  1,  then  t j(z*(P„)~  £,-'( 0))  z*(£„)+  4  z*(/»,+1)+  <  y*(/\+1).  If 

tj+-tj_-1,  then  v(z*(PH)~  F„'(0))  4  z*(P„)  4  zm(Pn+„  4  y*(P„+i).  Similarly,  if 
P*  >  P„*t<  then  by  Theorem  3,  y*(Pn)  4  y*(£„+ 1),  so  tj (z*(/>„)  -  £."'(0))  ^  tj (y*(P„))  4 
y*(P„)  4  y*(/*,+|)  by  Condition  (IV)  and  tj  nondecreasing.  Thus  in  all  cases, 
tj (z*(PK)-  £,"'(0))  <  y*(£»+i),  whence  by  a  result  of  Veinott  (18],  the  myopic  policy  is 
optimal  for  x  y*(P\).  Thus  (y,/3)  -  (y*(£„), /3  •(£„))  minimizes  the  term  in  braces  in  (2) 
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subject  to  /3  ^  0.  But  iy'iP"),  p  * (/*'’))  is  the  least  such  minimum  so  y*iP")  ^  y*(P„ (  and 
P  •(/»")  <  /3  *iP„).  And  by  Lemma  3,  p*iP")>  p'iP„),  so  /3  *(/>")- 0  *(/>,).  Also, 
ECb)iy  -  P  *iP")  -  U),Pn+x)  is  constant  for  y  <  y*(P„)  so  )  =  (>>* ( /*") ,  >3  •  { />") ) 
minimizes  Hiy)  +  P„ip)  +  S(y  -  / 3)  subject  to/3  >  0.  But  iy*iP„).p  *iP„))  is  the  least  such 
minimum,  so  y*(P„)  ^  y*(P")  and,  hence,  (.v*</>") . P  *(P"),  z* (/»"))  -  iy*(Pn). 
P  *(P„).  z*(P„)),  completing  the  proof. 

In  Theorem  9,  Jf*r  is  respectively  . where  T  —  [»,.  n2].  In 

Theorem  10,  tfT  is  defined  similarly. 

THEOREM  9. 

PROOF:  By  Theorem  2,  y*iH„ )  and  z*iH„)  are  unimodal  in  n  on  T  such  that  the  modes 
of  H„  on  3t,T  are  modes  of  y*iHn)  and  z*(,i„).  Thus,  from  Theorems  6  and  7,  y*(H")  and 
z*iH")  are  unimodal  in  n  on  T  such  that  m\  m"  <  m.  If  m'  <  m'\  then  z*(Hn)  <  z“iHn+') 
for  n  =  m"-  1  <  w,  so  by  Condition  (IV),  Lemma  3,  and  Lemma  5,  /3 *(//")  =  p *(//„). 
Thus,  by  Theorem  A2  and  Lemma  5,  p *(//")  =  p *(//„)  ^  0  *(//„+,)  <  p*(Hn+l).  But  since 
m'  <  n,  y*iH")  >  y*iHn+l),  so  z*iHn)  >  z*(W"+1)  which  is  a  contradiction.  Therefore, 
m  "  ^  m'. 

Now  by  Theorems  6  and  7,  (y •(//"), /3  *(//"),  z*(//"))  -  (>-•(//„).  /3  *(//„).  z*(//J)  for 
n  <  m".  Thus,  by  Theorem  2  and  Lemma  4,P*iH")  -  j8  •(//„)  ^  /3  *(//n+1)  ^  p  •(Hn+l)  for 
n  <  m".  It  remains  only  to  show  p  *(H")  is  nondecreasing  in  n  for  m"  ^  n  ^  m.  To  this  end 
consider 

(15)  Qiz\  H„)  =  min  l//n(y)  +  P(/3)). 

Since  by  Lemma  1,  H„iy)  is  subadditive  in  iy,Hn)  on  R  x  3t'*,  the  least  element  iy.p)  - 
iyiz,H„),  p(z;H„ ))  minimizing  the  term  inside  the  braces  in  (15)  over  the  indicated  sublattice 
is  nonincreasing  in  H„  on  J t„  by  Theorem  A2.  Replacing  y  by  P~i—z)  in  (15),  one  sees  from 
the  convexity  of  H„  and  example  A6  that  the  new  term  inside  the  braces  is  subadditive  in 
ip.—  z)  on  the  sublattice  {p  ^  0).  Thus  by  Theorem  A2,  fi iz\H„)  is  nonincreasing  in  z. 
Because  S  and  C"+l  in  recursion  (2)  are  functions  of  z  alone,  piz*iHn)\H„)  -  p *(//").  Thus, 
for  m"K  n  <  m,  z*(//n)  >  z*(//"+l)  and  H"  >  Hn+\  so  P*iH" )  <  /3‘(//"+l),  completing 
the  proof. 

THEOREM  10. 

PROOF:  By  Theorem  4  and  7,  z*iS")  is  unimodal  in  n  on  T  with  least  mode  not  exceed¬ 
ing  that  of  S„  on^*r.  Now  Hiy)  +  Piy  -  z)  is  subadditive  in  (y.z)  by  the  convexity  of  Pand 
example  A6.  Therefore,  by  Theorem  A2,  the  least  y  -  viz )  minimizing  Hiy)  +  Piy  -  z) 
subject  toy  ^  z  is  nondecreasing  in  z.  Thus,  y*iS")  —  y(z*(5'’))  is  nondecreasing  in  z*iS")  so 
y*(S")  is  unimodal  in  n  and  each  mode  of  z*(S")  is  a  mode  of  y*iS"). 

Now  Hiz  +  p)  +  Pip)  is  superadditive  in  (z,/3)  by  the  convexity  of  H  and  example  A6. 
Therefore,  by  Theorem  A2,  the  least  p  —  piz)  minimizing  Hiz  +  p)  +  Pip)  subject  to  p  >  0 
is  nonincreasing  in  z.  Hence,  p*iS")  -  piz*iSn))  is  nonincreasing  in  z*(S")  so  -p'iS")  is 
unimodal  in  n  and  each  mode  of  z*iS")  is  a  mode  of  -p*iSn),  completing  the  proof. 
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ABSTRACT 

This  paper  considers  a  single-machine  scheduling  problem  in  which  penali¬ 
ties  occur  when  a  job  is  completed  early  or  late.  The  objective  is  to  minimue 
the  total  penalty  subject  to  restrictive  assumptions  on  the  due  dates  and  penaity 
functions  for  jobs.  A  procedure  is  presented  for  finding  an  optimal  schedule 


INTRODUCTION 

For  the  most  part,  the  literature  of  scheduling  has  been  confined  to  problems  involving 
penalty  functions  which  are  nondecreasing  in  job  completion  times.  Conway,  Maxwell,  and 
Miller  ]2,  p.12]  refer  to  such  functions  as  regular  performance  criteria.  There  are,  however, 
many  applications  in  which  nonregular  criteria  are  appropriate.  Applications  of  nonregular 
measures  occur,  for  example,  in  file  organization  problems.  As  indicated  by  Merten  and  Muller 
[4],  minimizing  the  variance  of  retrieval  times  for  records  in  a  file  may  be  highly  desirable, 
especially  in  on-line  systems.  In  spite  of  the  importance  of  nonregular  performance  measures, 
very  little  analytical  work  has  been  done  in  this  area.  This  is  primarily  due  to  the  difficulty  of 
solving  this  type  of  problem.  The  problem  of  minimizing  completion  time  variance  has  been 
studied  by  Merton  and  Muller  [4],  Schrage  15]  and  Eilon  and  Chowdhury  (3],  Merton  and 
Muller  [4]  have  analyzed  the  relation  between  flowtime  and  waiting  time  variance.  They  do  not 
show  how  to  minimize  such  measures  but  do  show  that  if  some  schedule  S  minimizes  (maxim¬ 
izes)  one  of  the  measures,  then  the  antithetical  schedule  S'  minimizes  (maximizes)  the  other 
measure.  They  also  show  that  the  minimum  value  of  both  measures  is  the  same.  Schrage  [5] 
has  examined  scheduling  for  minimum  completion  time  variance  when  there  are  up  to  5  jobs  to 
be  scheduled.  Eilon  and  Chowdhury  [3]  have  extended  Schrage's  work  by  showing  that  for  a 
schedule  to  have  minimum  completion  time  variance  it  must  be  T-shaped.  Let  job  k  be  the  job 
with  the  smallest  processing  time  of  all  the  jobs  to  be  scheduled.  A  schedule  is  T-shaped  if  the 
jobs  placed  before  job  k  are  in  descending  order  of  processing  time  and  the  jobs  placed  after  job 
k  are  in  ascending  order  of  processing  time.  Eilon  and  Chowdhury  [3]  then  compare  various 
heuristics  for  minimizing  completion  time  variance. 

Apart  from  the  problem  of  minimizing  completion  time  variation  are  those  problems  in 
which  the  nonregular  measure  is  a  function  of  job  lateness.  Consider,  for  example,  a  job  shop 
which  produces  components  for  subsequent  assembly  into  finished  products.  The  due  dates  for 
components  are  based  on  the  assembly  schedule  of  the  end  products.  If  component  orders  are 
late  then  the  assembly  of  a  product  may  be  delayed.  The  negative  effect  could  be  loss  of 
assembly  efficiency  and  customer  good  will.  If  a  component  order  is  completed  early  it  must 
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wait  in  storage  until  the  production  date  of  the  product  for  which  it  is  needed.  The  negative 
effect  is  an  accumulation  of  component  inventory. 

Sidney  has  addressed  the  type  of  problem  noted  above  and  in  [6)  he  presents  an  algorithm 
for  minimizing  the  maximum  penalty  when  penalties  are  incurred  both  when  jobs  are  com¬ 
pleted  early  or  late.  In  this  paper  we  continue  the  work  of  Sidney  by  presenting  a  simple 
[0(n2)l  algorithm  for  minimizing  total  cost  when  costs  increase  linearly  as  a  job's  completion 
date  deviates  from  its  due  date. 

THE  PROBLEM 

Consider  a  single  machine  with  n  jobs  immediately  available  for  processing.  Associated 
with  each  job  /  is  its  required  processing  time  p,.  All  jobs  have  the  due  date  d  >  MS  where  MS 
is  the  makespan  of  the  job  set;  i.e., 

ms  =  jr  Pr 

i- 1 

The  objective  is  to  find  a  schedule  S  which  minimizes 
(1)  Z(S)  =  £  ais(0, -  d)/n, 

i~  l 

where  C,  represents  the  completion  time  for  job  i  and  abs  denotes  the  absolute  value  function. 
Equation  1  is  the  mean  absolute  lateness  (MAL)  resulting  from  schedule  S. 

DETERMINING  AN  OPTIMAL  SCHEDULE 

If  preemption  is  allowed,  the  problem  is  trivial.  In  that  case  an  optimal  solution  with 
MAL  -  0  can  always  be  obtained  by  processing  the  jobs  in  any  order  during  the  time  interval 
(d  -  MS,  d).  Every  job  is  interrupted  when  it  has  an  arbitrarily  small  amount  of  processing 
time  «  remaining.  After  the  initial  processing  of  all  jobs,  the  remaining  n  6  amount  of  work  is 
completed.  Since  e  can  be  made  arbitrarily  small,  the  completion  date  of  each  job  can  be  made 
to  converge  to  d,  that  is,  for  each  job  / 

lim  (a6s[C,  -</])=  0. 

«— o 

Thus  in  the  limit,  MAL  =  0. 

Suppose  that  preemption  is  not  allowed.  Then  let  B  represent  an  ordered  set  of  jobs  to  be 
scheduled  without  inserted  idle  time  such  that  the  last  job  in  B  is  completed  at  time  t  -  d.  Let 
A  represent  an  ordered  set  of  jobs  to  be  scheduled  without  inserted  idle  time  such  that  the  first 
job  in  A  starts  at  time  t  -  d.  The  following  algorithm  produces  an  optimal  schedule  5,  defined 
by  the  permutation  ( B .  A).  Let  U  denote  the  set  of  unscheduled  jobs.  The  symbol  <J>  denotes 
the  empty  set. 

procedure  SCHED: 

B  —  A  ~  4>; 
while  (U  <t>)  do 

remove  a  job  k  from  V  such  that  pk  -  maxi  a); 
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insert  job  k  into  the  last  position  in  B\ 
if  ( U  ^  <t>)  do 

remove  a  job  k  from  U  such  that  pk  —  max(/>,); 

I 

insert  job  k  into  the  first  position  in  A; 

end 

end 

S  *—  (S.  A)\ 
end  SCHED 

Figure  1  illustrates  how  the  procedure  works.  In  the  sample  problem  there  are  five  jobs  with 
processing  times  given.  The  due  date  for  each  job  is  t  -  39.  The  five  Gantt  charts  illustrate 
how  the  algorithm  progressively  assigns  jobs  to  the  two  sets  B  and  A. 

Problem  data: 

job  identification  :  Jl  12  J3  J4  J5 

processing  time  7  12  5  4  10 

due  date  =  39 
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PROOF  OF  THE  ALGORITHM 


To  prove  the  correctness  of  procedure  SCHED  first  observe  that  it  is  unnecessary  to  con¬ 
sider  schedules  that  have  idle  time  inserted  between  jobs  in  S.  To  show  this  is  true,  consider 
any  schedule  S  with  idle  time  inserted  either  before  or  after  d.  If  the  idle  period  occurs  before 
(after)  d,  remove  the  idle  time  by  moving  the  jobs  before  (after)  the  idle  period  forward  (back¬ 
ward)  in  the  schedule.  The  resulting  schedule  is  an  improvement  over  S  since  the  completion 
dates  of  all  jobs  affected  are  moved  closer  to  d.  This  process  can  be  repeated  until  all  inserted 
idle  time  is  removed  from  the  schedule. 


A  second  observation  is  that  schedules  which  have  a  job  begin  processing  before  d  and 
end  processing  after  d  need  not  be  considered.  To  show  this  let  S  be  such  a  schedule.  Let  k  be 
the  job  in  S'  which  is  in  process  at  time  d  (see  Figure  2). 


I—  pk(b)  —I—  pk(a)  —I 


Fioi-Rt  2. 


Let  pk(b)  be  the  amount  of  processing  time  completed  on  job  k  before  d  and  let  pk(a )  be  the 
amount  of  processing  time  completed  after  d.  Clearly,  either  |£|  <  M  I  or  |B|  >  Ml  (Ml 
denotes  the  cardinality  of  B).  First  suppose  \B\  <  Ml.  Then  all  jobs  can  be  shifted  to  an  ear¬ 
lier  completion  time  such  that  Ck  =  d.  The  change  in  Z  due  to  the  shift  is 

|Slp*(ai)  -  \A  \pk(a)  -  pk(a)  <  0. 

If  Ml  >  \A  I  all  jobs  can  be  shifted  to  a  later  completion  time  such  that  Ck  =»  d  +  pk.  The 
change  in  Z  due  to  this  shift  is 

\A\Pkib)  -  I B\pk(b)  +  Pk(b)  <  0. 

In  either  event  the  objective  function  cannot  increase  by  such  a  shift. 

Because  of  the  above  two  observations  and  because  preemption  is  not  allowed,  it  is 
sufficient  to  confine  the  search  for  an  optimal  schedule  to  the  set  of  permutation  schedules 
formed  by  the  concatenation  of  B  to  A.  By  specifying  such  a  permutation,  it  is  understood  that 
no  idle  time  shall  appear  between  jobs  and  that  the  last  job  in  B  will  be  completed  at  time 
/  =  d. 

Observe  that  the  following  properties  characterize  the  schedule  S  produced  by  SCHED: 

1.  The  jobs  in  Bare  sequenced  by  longest  processing  time  first  (LPT), 
the  jobs  in  A  are  sequenced  by  shortest  processing  time  first  (SPT); 

2.  If  n  is  even  I B  \  =  Ml, 

if  n  is  odd  | B I  —  Ml  +  I ; 

3.  There  is  a  one-to-one  mapping  of  the  jobs  in  A  into  the  jobs  in  B  such  that 
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k  €  A  —  j  €  B  -*■  pk  ^  p,. 


To  prove  that  SCHED  yields  optimal  solutions  we  must  show  that  the  above  properties 
are  both  necessary  and  sufficient  for  a  schedule  to  be  optimal.  We  begin  with  the  condition  of 
necessity  by  assuming  that  some  schedule  S*  with  these  properties  is  not  optimal.  Then  it  must 
be  that  another  schedule  S  ^  5*  is  optimal.  Note  that  because  there  may  be  jobs  with  identical 
processing  times  there  may  be  many  schedules  satisfying  properties  1,  2,  and  3  that  have  the 
same  value  for  Z.  5  is  none  of  these  sequences.  Then  for  S  ^  S*  it  must  be  that  at  least  one 
of  the  three  properties  is  not  met  for  S. 

First  consider  property  1.  Suppose  for  schedule  S  that  B  is  not  in  LPT  sequence  or  that  A 
is  not  in  SPT  sequence.  Assume  the  former  condition  is  true.  The  proof  of  the  latter  is  simi¬ 
lar.  If  B  is  not  in  LPT  sequence  then  there  exists  two  adjacent  jobs  j,  k  such  that  pj  <  pk  (See 
Figure  3). 


j  k 

B  d  A 

Figure  3. 

Consider  interchanging  jobs  j  and  k  to  form  S'.  Clearly,  such  an  exchange  does  not  affect  the 
penalty  for  any  job  other  than  jor  k.  The  change  in  Z  produced  by  such  an  exchange  is 

(2)  Z'  -Z  -  U  -  q)  -  (d  -  Cj)  +  (d-  C'k)  -  (d  -  C*>. 


The  first  two  terms  of  Equation  2  represent  the  change  in  penalty  for  job  j ;  the  second  two 
terms  are  the  change  in  penalty  for  job  k.  Substituting 

Cy'  -  Ck.  C'k-  Ck-  Pj.  and  q  -  Ck-  pk  yields 

Z'-Z-pj-  Pk  <  0. 

Thus,  Z(S')  <  Z(S),  contrary  to  the  assumption  that  S  is  optimal. 

Next,  assume  that  S  *  S*  because  property  2  is  not  met  for  S.  For  this  to  be  so,  it  must 
be  that  either  |f?|  <  \A  |  or  |fl!  >  \A  I  +  1. 

CASE  1:  |0|  <  |/4 1 

Let  k  be  the  smallest  job  in  A.  By  property  1,  k  must  be  the  first  job  in  A.  Form 
schedule  S'  by  removing  k  from  A  and  making  it  the  last  job  in  B.  The  resulting  change  in  Z  is 

(3)  Z'~  Z~Pk\B\  -pk\A\. 
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The  first  term  in  Equation  3  is  the  increase  in  penalty  for  the  jobs  originally  in  B.  The  second 
term  is  the  decrease  in  penalty  for  the  jobs  originally  in  A.  Without  loss  of  generality  assume 
|B|  -  U|  -  1.  Then 

2'  -  Z  -  -  Pk  <  0, 

which  is  contrary  to  the  assumption  that  S  is  optimal. 

CASE  2:  \B\  >  \A  +  1 

Let  k  be  the  smallest  job  in  B.  By  property  I ,  k  must  be  the  last  job  in  B.  Form  schedule 
S'  by  removing  k  from  B  and  making  it  the  first  job  in  A.  The  resulting  change  to  Z  is 

(4)  Z’- Z  =  pk\A\- pk(\B\~  l)  +  pk. 

The  first  term  in  Equation  4  is  the  increase  in  penalty  for  the  jobs  in  A.  The  second  term  is  the 
decrease  in  penalty  for  all  jobs  in  B  other  than  k.  The  third  term  represents  that  net  change  to 
job  k.  Since  \B\  >  \A  |  +  I  it  follows  that 

Z-  Z  4  0. 

Clearly,  if  Z'  —  Z  <  0,  then  S  cannot  be  optimal.  If  Z'  —  Z  =  0,  then  it  must  be  that 
|B|  —  \A  |  4-  2,  allowing  S  to  be  transformed  into  an  equivalent  schedule  satisfying  property  2. 


Finally,  consider  property  3.  Assume  that  S  ^  S*  because  no  mapping  satisfying  properly 
3  exists  for  S.  Note  that  if  S  is  optimal,  properties  1  and  2  must  hold.  Attempt  to  construct  a 
mapping  satisfying  property  3  by  the  following  procedure.  Start  with  the  largest  (last)  job  in  A 
and  match  it  to  the  largest  (first)  job  in  B.  Continue  with  the  second  largest  job  and  so  on  until 
the  next  job  k  in  A  to  be  matched  to  j  in  B  cannot  be  done  because  pk  >  pj.  The  attempt  to 
map  jobs  must  fail  in  this  fashion,  otherwise  there  would  be  a  mapping  contrary  to  assumption. 
Let  BJ  be  the  jobs  that  precede  j  and  let  AK  be  the  jobs  that  follow  k  in  S  (see  Figure  4). 
Clearly  \BJ  |  =  \AK\.  Now  form  S'  by  interchanging  jobs  j  and  k.  This  affects  the  penalty  of  j 
and  kand  the  jobs  in  fi/and  AK,  but  no  other  jobs. 


J 

I-*-  X  -*■ 

k 

-  y  -1  1 

BJ 

j 

k 

AK 

i 

B  d  A 


Figure  4. 


Let  j x  —  d  -  Cj  and  y  -  Ck  -  pk  -  d.  The  change  in  penalty  caused  by  forming  S'  is 
(5)  Z  -  Z  -  \BJ\(pk  -  Pj)  +  \AK\(Pj  -  pk)  +  (x  -  y  -  pk)  +  O'  +  P,  -  x). 
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The  first  term  of  Equation  5  is  the  change  in  penaJty  for  the  jobs  in  BJ.  The  second  term  is  the 
change  for  jobs  in  AK.  The  third  term  is  the  change  to  job  /r,  and  the  fourth  term  is  the  change 
to  job  j.  Equation  5  simplifies  to 

Z'  -  Z  =  Pj  -  pk  <  0, 

which  contradicts  the  assumption  that  S  is  optimal.  This  completes  the  arguments  for  the 
necessity  of  properties  1-3.  To  show  that  the  three  properties  are  sufficient  to  define  an  optimal 
schedule  assume  that  S'  exists  which  satisfies  the  three  properties  and 

(6)  Z(S')  >  Z(S) 

where  S  is  an  optimal  schedule.  If  S  is  optimal  then  it  too  must  satisfy  the  three  properties. 
Now 

Z(S')  >  Z(S)  •*>  S'  *  S 

but  S'  =  5  since  the  three  properties  are  sufficient  to  define  a  permutation  of  the  jobs.  This 
contradicts  (6)  and  completes  the  proof. 

SIMILARITY  TO  THE  2-MACHINE  F  PROBLEM 


At  first  glance ^rne  might  mistakenly  believe  that  the  problem  defined  here  can  be  reduced 
to  the  2-machine  F  problem  discussed  by  Baker  (1,  p.  118]  among  others  (e.g.,  Conway, 
Maxwell,  and  Miller  [2,  p.  74]).  By  referring  again  to  procedure  SCHED  and  Figure  1  we  can 
see  how  such  a  reduction  might  be  construed.  We  simply  let  B  and  A  represent  the  two 
machines  and  assume  that  the  jobs  assigned  to  these  two  machines  are  sequenced  in  SPT  order 
for  each  machine.  The  example  job  set  shown  in  Figure  1  would  then  be  sequenced  as  indi¬ 
cated  in  Figure  5. 


Machine  A 


Machine  B 


Figure  5. 


If  the  MAL  problem  presented  here  were  equivalent  to  the  2-machine  F  problem  then  any 
optimal  solution  to  one  would  also  be  optimal  for  the  other.  This  is  not  so  and  to  see  why  con¬ 
sider  the  example  problem  appearing  in  Baker  (1,  pp.  1 18-1 19]  and  given  below: 

job  j  1  2  3  4  5  6 

Pj  1  2  3  4  5  6 
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As  Baker  points  out,  ore  optimal  solution  to  the  F problem  is  (not  to  scale): 


Machine  1 

rn 

□j 

5 

Machine  2 

□j 

3 

6 

i 

Let  us  assume  a  due  date  of  d  —  100.  One  could  then  "interpret"  the  above  solution  as  either: 


(a)  1  6 

1 

3  i  2 

|  1 

n — 4 

- 1 

5 

□ 

95 

98 

d 

101 

105 

no 

or 

1 

(b)  □ r 

| 

4  1  1 

_  2  . 

1  3 

| 

6 

Z1 

95 

99 

1 

d 

102 

105 

in 

Case  (a)  yields  MAL  of  23/6. 
optimal  schedule: 

Case  (b)  yields  MAL 

_ 

=  24/6. 

Procedure  SCHED 

16 

I 

4.1  2 

i  r 

1  3 

r 

5 

□ 

94 

98 

101 

104 

109 

which  has  an  MAL  of  22/6.  Clearly  then  an  optimal  solution  to  the  F  problem  is  not  always  an 
optimal  solution  to  the  MAL  problem.  Therefore,  the  two  problems  are  different.  The  reason 
for  this  lies  in  the  fact  that  property  3  (given  above)  is  necessary  for  an  optimal  solution  to  the 
MAL  problem  but  not  necessary  for  the  F  problem. 

SUMMARY 

What  we  have  addressed  here  are  problems  in  which  penalties  for  jobs  are  incurred  both 
when  they  are  completed  early  or  late.  The  critical  condition  that  d  ^  MS  is  sufficient  to 
ensure  that  procedure  SCHED  does  not  require  jobs  to  be  processed  prior  to  t  —  0.  It  has  been 
shown  that  under  this  assumption  the  optimal  schedule  must  be  K-shaped.  Whereas  procedure 
SCHED  provides  optimal  solutions  under  the  noted  conditions,  future  research  must  be 
directed  toward  problems  in  which  d  is  not  restricted  and  to  the  case  of  multiple  due  dates. 
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ABSTRACT 

This  paper  presents  a  mathematical  model  that  yields  the  area  drained  by  a 
naturally-occurring  network  of  streams.  The  model  is  based  on  empirically 
derived  relationships  in  the  field  of  quantitative  geomorphology  and  an  assump¬ 
tion  concerning  the  probabilistic  nature  of  stream  system  formation  A  wide 
range  of  model  solutions  is  indicated,  and  the  model  is  validated  by  comparing 
the  results  to  statistics  from  actual  stream  systems. 


INTRODUCTION 

The  areas  drained  by  stream  systems  are  necessary  factors  in  the  solution  of  many  prob¬ 
lems  in  engineering  design  and  military  planning.  Securing  such  areas  using  topographic  maps, 
aerial  photographs,  or  actual  measurement  is  often  fraught  with  difficulties  related  to  such  fac¬ 
tors  as  time,  money,  and  access.  This  paper  presents  a  model  that  was  developed  to  satisfy 
such  data  requirements  analytically  through  the  quantification  of  a  few  simple  parameters  in  the 
geographic  areas  of  interest.  The  model  is  based  on  empirically  derived  relationships  in  the 
field  of  quantitative  geomorphology  and  assumptions  concerning  the  probabilistic  nature  of 
stream  system  formation.  Model  solutions  are  indicated  for  stream  systems  of  various  sizes  and 
model  validity  is  examined  using  statistics  from  actual  stream  systems. 

BACKGROUND 

The  study  of  the  formation  of  drainage  patterns  from  a  stochastic  viewpoint  had  an  early 
beginning  In  1802,  John  Playfair,  Professor  of  Natural  Philosophy,  University  of  Edinburgh, 
wrote  Illustrations  of  the  Huttonian  Theory  of  the  Earth.  The  following  passage  has  been  called 
Playfair’s  Law: 


'Every  river  appears  to  consist  of  a  main  trunk,  fed  from  a  variety  of 
branches,  each  running  in  a  valley  proportioned  to  its  size,  and  all  of 
them  together  forming  a  system  of  valleys,  communicating  with  one 
another,  and  having  such  a  nice  adjustment  of  their  declivities  that 
none  of  them  join  the  principal  valley  either  on  too  high  or  too  low  a 
level;  a  circumstance  which  would  be  infinitely  improbably  if  each  of 
these  valleys  were  not  the  work  of  the  stream  which  flows  in  it." 
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Playfair  obviously  recognized  the  systematic  aspects  of  a  natural  drainage  system;  however,  lit¬ 
tle  was  done  to  quantify  these  aspects  prior  to  the  work  of  R.  E.  Horton  in  1945  [3).  Horton’s 
bifurcation  and  stream  length  ratios  are  particularly  important  to  this  model  and  to  most  of  the 
work  that  has  been  accomplished  in  the  field  since  Horton’s  publication. 

In  recent  years,  probability  theory  has  been  used  in  an  attempt  to  explain  and  quantify 
various  aspects  of  naturally  occurring  drainage  networks.  Prominent  among  the  works  in  this 
area  have  been  those  by  Hack  [2],  Leopold  and  Langbein  [41  and  Schenck  [81.  The  general 
theory  basic  to  these  works  is  that  the  formation  of  drainage  networks  can  be  explained  by  the 
laws  of  probability  and  that,  free  of  constraints,  streams  tend  to  occur  in  a  random  fashion. 
Leopold  and  Langbein  viewed  the  laws  of  Horton  as  representing  the  most  probable  state  in  a 
stochastically  formed  network  and  used  a  random  walk  model  to  show  that  the  laws  of  Horton 
did,  in  fact,  hold.  Schenck  used  a  similar  model  to  substantiate  the  empirical  work  of  Gray  [1]. 

The  first  known  attempt  at  constructing  a  predictive  model  was  made  by  Mayer  [5).  The 
Mayer  model  was  an  attempt  to  use  the  work  of  the  several  previous  authors  in  the  field  to  con¬ 
struct  drainage  basin  relationships  that,  with  quantification  of  a  few  parameters,  could  be  used 
to  predict  the  numbers,  orders,  and  hydrologic  characteristics  of  streams  within  a  naturally 
occurring  drainage  network.  The  model  described  in  this  paper  has  some  of  the  characteristics 
of  the  Mayer  model. 

BASIC  CONCEPTS 

Basic  to  the  model  is  the  concept  of  a  drainage  basin.  Loosely  defined,  a  drainage  basin  is 
the  total  area  drained  by  a  stream  and  its  tributaries.  Every  drainage  basin  is  circumscribed  by  a 
drainage  divide  that  separates  the  precipitation  that  falls  into  that  basin  from  the  precipitation 
that  falls  into  contiguous  basins.  Drainage  basin  areas  of  the  same  order  are  nonoverlapping; 
and  the  total  area  drained  by  all  basins  exhausts  the  surface  of  the  earth's  landmass.  These 
concepts  have  long  been  used  in  the  study  of  stream  morphology  and  are  basic  to  the  relation¬ 
ships  described  in  this  paper. 

Also  basic  to  the  model  is  the  Horton  method  of  stream  ordering.  This  method  traces 
each  stream  to  its  drainage  divide  and  considers  that  each  stream  extends  from  its  mouth  to  pri¬ 
mary  headwater.  The  method  is  illustrated  in  Figure  1. 

BASIC  RELATIONSHIPS 

The  model  presented  in  this  paper  draws  heavily  on  four  empirically  established 
relationships— two  derived  by  Horton  [3],  one  derived  by  Schumm  [91,  and  one  derived  from 
works  by  Hack  [2].  Although  the  relationships  are  least-squares  approximations,  they  represent 
data  collected  over  wide  geographic  areas,  and  the  degrees  of  fit  were  generally  quite  good. 

The  Horton  relationships  may  be  stated  as  follows: 

(1) 

(2)  n ,  - 

where 


I,  -  mean  length  of  an  Ah-order  stream. 
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Figure  I.  Horton  definition  of  stream  order  (redrawn  from 
Shreve  HOI).  Order  indicated  by  number  near  upstream  end 
of  respective  streams.  Unnumbered  streams  are  first  order. 


n,  “  number  of  rth-order  streams  in  an  Mh-order  drainage  basin, 

a  —  stream  length  ratio,  a  constant  factor  derived  from  empirical  evidence  by  which 
the  mean  length  of  an  rth-order  stream  exceeds  the  mean  length  of  an  (/  -  l)st- 
order  stream, 

and 

fi  «  bifurcation  ratio,  a  constant  factor  derived  from  empirical  evidence  by  which  the 
number  of  (/'  -  Dst-order  streams  exceeds  the  number  of  rth-order  streams  in  an 
Mh-order  drainage  basin. 

In  examining  collections  of  data,  Horton  and  others  found  that  the  values  observed  for  a  and  0 
varied  somewhat  from  basin  to  basin,  depending  on  such  factors  as  gradient,  soil  composition, 
rainfall,  etc.,  but  that  the  values  recorded  in  mature  basins  tended  to  cluster  around  2.68  and  4, 
respectively.  Consequently,  these  values  are  referred  to  throughout  this  paper  as  the  equili¬ 
brium  values  for  these  basic  parameters. 

The  Schumm  relationship  may  be  stated  as  follows: 

(3)  A “  An 


! 
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where 

A,  «*  area  drained  by  al!  Ah-order  basins  in  an  Mh-order  parent  basin, 
and 

A  =  basin  area  ratio,  a  constant  factor  by  which  the  total  area  drained  by  all  basins  of 
Ah-order  exceeds  the  total  area  drained  by  all  basins  of  (/  -  Ust-order  in  an  Mh- 
order  drainage  basin. 

The  Hack  relationship  may  be  stated  as  follows: 

(4)  /.-/.a''1, 

where 


f,  -  area  drained  by  overland  flow  occurring  directly  into  an  Ah-order  stream, 

and  a  is  the  stream  length  ratio  defined  by  Horton.  Note  that  the  area  drained  by  overland  flow 
is  that  area  for  which  the  sheet  flow  is  insufficient  in  length  to  sustain  a  first-order  stream,  e  g., 
the  area  immediately  adjacent  to  the  banks  of  a  stream. 

Using  the  Hack  and  Schumm  relationships  together,  we  can  also  form  the  useful  relation¬ 
ship: 

(5)  Fn  =  (A  a/{i)N-'AN. 

where 


Fn  -  total  area  drained  by  overland  flow  occurring  directly  into  the  Mh-order  stream. 

THE  JUMPING  PROCESS 


A  phenomenon  of  particular  importance  in  describing  the  behavior  of  streams  is  the  pro¬ 
cess  of  "jumping,"  a  phenomenon  often  ignored  in  constructing  drainage  system  models. 
Specifically,  a  stream  of  order  i  is  said  to  jump  if  at  its  mouth  it  intersects  a  stream  of  order 
/  +  2  or  greater;  i.e.,  the  stream  does  not  bifurcate  with  the  next  higher  order  stream  but  jumps 
at  least  one  order  in  the  hierarchy.  To  describe  this  process  analytically,  the  following  assump¬ 
tion  is  made: 


ASSUMPTION:  Within  an  Mh  order  parent  basin,  streams  of  order  /  bifurcate  with 
streams  of  order  i  +  1  and  greater  in  direct  proportion  to  the  relative  total  stream  length  of  the 
recipient  streams.  Based  on  this  assumption,  the  proportion  of  streams  in  an  Mh-order  parent 
basin  that  jump  to  the  order  A7  stream  may  be  derived  as  follows: 


First,  the  number  of  Ah-order  streams  that  bifurcate  with  (/'  +  l)st-order  streams  may  be 
written  as 


N 

ttj.i+1  “  +  ^  I  I  ■ 

y-/+l 
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Dividing  this  quantity  by  n ,  and  taking  its  complement  yields  the  proportion  of  Ah-order 
streams  that  bifurcates  with  all  orders  greater  than  i  +  1  (i.e.,  the  proportion  that  jumps).  This 
proportion,  denoted  Pn  may  be  written  as 

Pi  -  1  -  /  L  n,h- 

/  y-i+l 

which,  using  equations  (1)  and  (2),  sums  to 

P i  -  a(j3,v~'_l  -  aN~‘-')  /  (PN~‘  -  a  v~0. 


Next,  of  the  Ah-order  streams  that  jump,  the  proportion  that  jumps  specifically  to  the 
parent  stream  of  order  N  >  /  +  2,  denoted  Pi  N,  may  be  written  as 

N 

P i.H  ~  %(v/  £  njh‘ 

j-,+2 


which,  using  equations  (1)  and  (2),  sums  to 


P,N  «  (p  -  a)l(pN~l~'‘ 


„  N-i- 1 


). 


Finally,  the  proportion  of  all  rth-order  streams  that  jumps  to  the  order  N  stream  may  be 
expressed  as 

(6)  P,Pl  N  =  a"-'-'  (p  -  a)/(pN-‘  -  a*- 0. 

Solutions  of  this  equation  for  parent  basins  of  orders  3  through  9  and  equilibrium  a,  p  values 
are  set  forth  in  Table  1. 


TABLE  1  —  Proportion  of  Streams  in  a  Drainage 
Basin  That  Jump  to  Parent  Stream 


Order  of 
Jumping 
Stream  (/) 

Order  of  Parent  Stream  (A) 

3 

4 

5 

6 

7 

8 

9 

1 

.401 

.212 

.124 

.032 

2 

0 

.212 

.124 

.049 

.032 

2 

0 

0 

.212 

.124 

4 

0 

0 

.212 

.124 

.077 

5 

0 

0 

.212 

.124 

6 

0 

0 

jliJI] 

7 

0 

0 

NOTE:  Values  were  computed  using  a  -  2.68  and  f}  -  4,  the  equilibrium  values. 


THE  FUNDAMENTAL  RELATIONSHIP 

In  examining  collections  of  field  data,  Hack  [2]  found  that  the  area  of  a  basin  could  be 
related  to  its  length  and  shape  by  the  power  function 

/,  -  kaf, 

where  l,  represents  the  mean  length  of  Ah-order  streams,  o,  the  mean  area  of  Ah-order  basins,  h 
a  parameter  indicating  the  shape  of  the  mean  basin,  and  k  a  constant  factor  by  which  the  length 
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of  the  /th-order  stream  exceeds  the  length  of  the  fth-order  basin.  Assuming  that  h  and  k  are 
constant  in  relatively  homogeneous  basins  of  adjacent  order,  we  can  write 

a  - 

which,  multiplying  both  sides  by  1//3,  reduces  to  the  fundamental  relationship 
(7)  0-A  aUh. 

It  can  be  shown  that  h  is  logically  bounded  by  0.5  ^  h  <  1.0.  However,  Hack  [2],  Mayer  [5], 
and  others  found  that  values  not  exceeding  0.6  were  typical  for  h  in  relatively  mature  basins. 

AN  ADMISSABLE  REGION  FOR  MODEL  PARAMETERS 


The  area  of  a  parent  basin,  denoted  Ay,  may  be  written  as  the  sum  of  three  components: 
the  area  drained  by  included  basins  of  order  N  -  1  that  bifurcate  directly  with  the  order  N 
stream;  the  area  drained  by  the  basins  of  orders  N  -  2  and  lower  that  jump  directly  to  the 
order  N  stream;  and  the  area  drained  by  overland  flow  occurring  directly  into  the  order  N 
stream.  Symbolically,  this  relationship  may  be  written  as 


—  An_  1  +  P/v-iP Af-2.N^N-2  +  '  ■  ■  +  P \P I  \A  I  +  Fs , 
which,  using  equations  (3),  (5),  and  (6),  becomes 


An  —  A/4/v  +  An 


0-a 

V-l 

r 

(A//3)-' 

a 

L 

)- 2 

a1  -  $  > 

+  Ay(ka/l 3)' 


and  further  reduces  to  the  basic  parametric  relationship 


(8) 


P~  a 

/v-l 

V 

(A//3)' 

a 

2. 

y-2 

a "2- 

+  A  +  (W0)v~'  -  1  -  0. 


This  equation  defines  the  equilibrium  behavior  of  A,  the  basin  area  ratio,  within  a  relatively 
homogeneous  drainage  basin.  The  bounds  on  A  are  determined  by  the  bounds  on  P„  the  pro¬ 
portion  of  streams  that  jump.  Hence,  A  assumes  its  minimum  value  when  P,  is  maximum,  i.e., 
P,  -  1,  all  /  <  N ,  and  A  assumes  its  maximum  value  when  P,  -  0,  all  i  <  N.  With  P,  =  0,  all 
i  <  /V,  we  obtain 

(9)  A  4-  (Aa/p)*"1  -  1  -  0, 

which  defines  the  upper  bound  on  the  admissable  region  for  the  a,  /3  pairs.  With  P,  -  1,  all 
i  <  N,  we  obtain 


(10) 


P 


p-«  V 

(A//3)2 

«  5 

al-2  -  p\~j 

+  A  +  i\a/p)N-'  -  1  -  0. 


which  defines  the  lower  bound  on  the  admissable  region  for  the  a,  ^  pairs.  Using  equation  (7) 
to  define  the  interdependence  of  the  parameters,  we  can  further  write  equations  (8),  (9)  and 
(10)  as  follows: 


(11) 

(12) 


(3-a 

a-J,h 

«  2- 

"  2-2 

a~j-  p-J 

+  pa-'/k  +  aWMh-iHN-l)  _  |  .  0. 
fia- l/A  +  _  g 
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(13) 


0 


p-« 

N- 1 

r 

a 

L 

J-2 

4-  $a~Vh  +  o'l/AM/i-IH.v-l) 


1  -  0. 


Solutions  of  equations  (11),  (12)  and  (13)  yield  the  admissable  region  for  the  a,  (3  pairs  for 
each  parent  basin  and  specified  value  for  6,  the  basin  shape  parameter.  The  equations  may  be 
solved  using  such  techniques  as  the  Newton-Raphson  method  of  successive  approximation  [7], 
Solutions  for  N  -  9  and  h  —  0.55  are  illustrated  in  Figure  2. 


Figure  2.  Admissable  region  for  a.  p  pairs  dhe  middle  curve 
is  ibe  locus  of  equilibrium  values) 


BASIN  AREA  MODEL 

As  illustrated  in  the  previous  section,  the  area  drained  by  a  parent  basin  may  be  parti¬ 
tioned  into  three  components.  The  recursive  nature  of  this  relationship  is  developed  in  Table  2 
for  parent  basins  of  successively  higher  order.  The  development  in  Table  2  suggests  that  the 
area  of  an  /Vth-order,  relatively  homogeneous  basin  may  be  expressed  in  recursive  form  as  fol¬ 
lows: 

N  N-l  N 

04)  ^-10*-'/,+  E  £ 

/-I  r-l  j-i+2 

Using  equation  (4),  the  Hack  relationship  for  overland  flow  areas,  the  first  term  of  equation 
(14)  may  be  summed  mathematically  and  expressed  as: 
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TABLE  2  —  Recursive  Relationship  of  Included  Basin  Areas 


Parent 

Area  of  Parent  Basin  by  Component 

Basin 

Order 

Orthodox  Basins 

Jumping 

Overland 

Total  Area  Drained 

(/> 

<4,  |> 

Basins 

Flow 

t.4,l 

<V 

«/a» 

- 1 

1 

0 

0 

'i 

A 

2 

HA 

0 

/: 

Hi  i  + 

3 

H2A  +  HA 

*^1.3 

t\ 

tJ 2 / ,  +  lil2  +  1,  +  J,  J 

4 

H' '/,  +  H2A  +  HA  +  HJ  i.j 

4 4  +  A.i 

A 

J»Vi  +  0Vj  +  0/i  +  /a  +  i  +  A  4  +  4 

,V 

V  I  V  \  V  | 

!>'  '/,+ 1  I  hs  'Ju 

tm i  t-\ 

N  -2 

I  As 
<-  1 

A 

Zt8'  7,  +  £  Z  0'  4, 

/-i  /-I  i-.+: 

(15)  £/3v  '/;  =  ,/|(/3v-  avV(/3  -  «). 

It  may  be  observed  that  equation  (15)  is  the  fundamental  Hack  equation  for  an  .Vth-order  basin 
area  [2].  Hack  did  not  consider  the  process  of  jumping  in  his  development  which  made  his  area 
equation  deficient  by  the  jumping  component.  The  second  term  of  equation  (14)  may  be 
expanded  by  considering  the  following  development. 

From  the  jumping  process  described  in  a  previous  section,  the  total  area  drained  by  a 
stream  of  order  A  ^  1  that  jumps  to  a  parent  stream  of  order  /  ^  A  +  2  may  be  written  as 

A.  I  =  P 

That  is,  in  any  parent  basin  of  order  /  there  are  n  streams  of  order  A,  each  of  which  has  area  Ak 
and  a  proportion  equal  to  PkPkj  that  bifurcates  directly  with  (jumps  to)  the  order  /  stream. 
Therefore,  Jk  t  is  the  total  area  drained  by  those  Ath-order  streams  that  bifurcate  directly  with 
the  parent  (order  /)  stream.  Using  equation  (2)  for  nk  and  equations  (14)  and  (15)  for  Ak ,  this 
relationship  may  be  written  in  recursive  form  as 


(16)  PkPk.,(*,k 

where  A  ^  1  and  /  >  A  +  2. 


k-2  k. 

,/j(/3*  -  ak)/(p  -  a)  4-  L  Z  0 *  7,, 

/-I  y-/+2 


k  >  2 


Thus,  the  total  area  drained  by  an  Mh-order  basin  may  be  written  as 

<<7)  As  =  f\ (/3V  -  -  a)  +  £  L  P"~'Af 

/-*+ 2 
N  >  2 
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where  Jk  l  is  given  by  equation  (16).  Solutions  of  equation  (17)  for  a  range  of  a.  0  values  are 
set  forth  in  Table  3.  Table  3  also  reflects  the  implied  values  for  A,  the  basin  area  ratio,  for 
adjacent  equilibrium  basins. 


TABLE  3  —  Areas  Drained  by  Parent  Basins  for  a  Range  of  Parameter  >  s 


Parent 

Stream 

Order 

(A) 

o - 1 0% 

«  +  10% 

Equilibrium 

Values 

0+10% 

0-10% 

Basin  Area 
Ratio  for 
Equilibrium 
Values  (A) 

c*  =  2.41 

0  =  4 

a  =2.95 

0  =  4 

a=2.68 

0  =  4 

a=2.68 

0  =  4.4 

o  =  2.68 

0  =  3.6 

1 

i.o 

1.0 

I.O 

1.0 

1.0 

— 

2 

6.4 

7.0 

6.7 

7.1 

6.3 

3 

37.5 

43.3 

40.3 

45.7 

35.3 

4 

214.2 

261.3 

237.0 

288.0 

192.4 

5 

1217.4 

1559.3 

1380.7 

1804.9 

1035.7 

6 

7048.6 

9459.5 

8189.7 

11512.1 

5673.6 

■ 

7 

42266.4 

59456.1 

50340.5 

75911  7 

32271.8 

8 

263222.9 

388819.2 

321759.2 

518871.1 

191464.6 

■ 

9 

1696276.2 

2635843.5 

2130301.0 

3663590.5 

1180099.3 

NOTE:  Area  values  are  in  terms  of/,,  the  mean  area  drained  by  lst-order  streams,  expressed  in  square  miles 


The  a,  0  values  reflected  in  Table  3  were  chosen  to  examine  ihe  sensitivity  of  the  area 
model  to  these  basic  parameters.  The  values  all  lie  within  the  admissable  region  depicted  in 
Figure  2.  The  results  show  that  the  model  is  very  sensitive  to  both  a  and  0,  but  is  most  sensi¬ 
tive  to  0.  For  example,  for  A  =  9,  a  10  percent  increase  in  0  causes  a  72  percent  increase  in 
basin  area,  whereas  a  10  percent  increase  in  a  causes  only  a  24  percent  increase  in  basin  area. 
Sensitivity  to  both  parameters  also  tends  to  increase  as  A  increases.  The  sensitivity  results  sug¬ 
gest  the  importance  of  quantifying  the  model  parameters  in  the  area  of  interest,  especially  if 
there  is  reason  to  suspect  the  presence  of  nonequilibrium  behavior. 

The  values  for  the  basin  area  ratio,  A,  in  Table  3  were  computed  using  equation  (3).  This 
equation  may  be  written  as  A-4V  =  <4.v-g  where  -4V-i  is  defined  to  be  the  total  area  drained  by 
all  (A  —  l)st-order  streams  in  an  Ath-order  parent  basin.  However,  the  areas  given  in  Table  3 
are  for  a  single  basin  of  each  order.  Therefore,  in  terms  of  the  values  in  Table  3  we  can  write 
AA;v  =  0/f*-,,  which  implies  that  A  differs  from  unity  by  the  sum  of  two  areas:  the  area 
drained  by  overland  flow  into  the  Mh-order  steam  and  the  area  drained  by  those  streams  of 
orders  A '  —  2  and  lower  that  jump  to  the  Mh-order  stream.  Thus,  A  can  be  interpreted  as  the 
percent  of  the  area  drained  by  an  Mh-order  stream  that  is  due  to  included  basins  of  (A  -  1  lst- 
order.  The  balance  (I  -  A)  is  the  proportion  that  is  drained  by  overland  flow  into  the  Mh- 
order  stream  and  the  streams  of  order  A  -  2  and  lower  that  jump  directly  to  the  Mh-order 
stream  and  are  not  included  in  (A  -  Dst-order  basins. 

MODEL  VALIDATION 

The  acid  test  of  a  model  is  the  extent  to  which  the  model  output  correlates  with  observed 
data.  In  this  case,  comparison  data  are  available  for  several  of  the  major  stream  systems  of  the 
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world.  (See,  for  example,  Horton  13]  and  Morisawa  [6]).  Table  4  lists  relevant  data  for  four  of 
the  world’s  major  river  systems  and  one  smaller  tributary  of  the  Delaware  River. 


TABLE  4  —  Comparison  of  Model  Results  with  Actual  River  Systems 


River 

System 

River 

Length 

(Actual) 

(miles) 

Area 
Drained 
(Actual) 
(sq  miles) 

Computed 

Order* 

(A3 

Area 

Drained 

(Model) 

(sq  miles) 

Implied  Mean  Area 
Drained  by  First 
Order  Streams* 

(sq  miles) 

(/.) 

Delaware, 
East  Branch 

5/ 

785 

5 

1,381  /, 

.57 

Tennessee 

900 

40,600 

7 

50,341  /, 

.81 

Arkansas 

1450 

160,500 

8 

321,759  /, 

.50 

Mississippi 

3892 

1,243,700 

9 

2,130,301  /, 

.58 

Nile 

4160 

1,150,000 

9 

2,130,301  /, 

.54 

•See  text  for  discussion. 


The  orders  of  the  parent  streams  in  Table  4  were  computed  by  setting  /'  —  N  in  equation 
(1)  and  writing  the  equation  as  follows: 

N  -  log(V/,)/loga  +  1. 

The  values  for  N  were  then  computed  using  a  -  2.68,  its  equilibrium  value,  and  lx  -  1 .  While 
/i  tends  to  vary  between  about  0.5  and  1.5  miles  depending  on  how  well  an  area  drains,  the 
values  tend  to  cluster  about  unity  in  mature  basins.  (See,  for  example.  Hack  [2]  and  Horton 
[3].)  It  may  also  be  observed  that  both  the  areas  drained  and  the  lengths  of  the  parent  streams 
tend  to  vary  somewhat  within  basin  order  since  the  basin  orders  are  stated  as  integers  and  tend 
to  increase  in  proportion  to  the  logarithm  of  the  areas  drained. 

A  measure  of  the  consistency  with  which  the  model  estimates  the  actual  areas  is  given  in 
Table  4  by  the  implied  mean  values  for  the  areas  drained  by  first-order  streams,  /j.  Examina¬ 
tions  of  field  data  suggest  that  this  value  varies  in  nature  between  about  0.1  and  1.0  square 
miles,  but  that  values  of  0.5  to  0.8  are  typical  for  mature  basins.  (See,  for  example.  Hack  [2], 
Horton  [3],  and  Morisawa  [6].)  Thus,  the  model,  solved  for  equilibrium  parameter  values, 
appears  to  produce  very  good  approximations  of  actual  basin  area  size.  Better  estimates  could 
be  expected  if  the  parameters  were  quantified  in  the  specific  areas  of  interest;  however,  the 
effort  required  to  obtain  such  estimates  is  often  substantial. 

A  final  measure  that  tends  to  validate  the  model  is  given  by  the  X  values  in  Table  3.  This 
can  be  seen  by  examining  the  relationship  implied  by  the  model  results  between  (i  and  the 
basin  shape  parameter,  h.  Since  the  area  model,  equation  (17),  is  not  a  function  of  h,  the  X 
values  in  Table  3  were  generated  independent  of  basin  shape  and  thus  could  be  considered  to 
represent  the  equilibrium  condition  with  respect  to  h.  Equation  (7),  which  relates  X  to  h,  can 
be  written  as: 

h  -  —  loga/log(X//3). 

By  substituting  the  high  and  low  X  values  from  Table  3  into  this  equation  and  using  equilibrium 
a  and  0  values,  we  obtain  an  implied  range  for  h  of  0.52  to  0.56  which  correlates  closely  with 
values  observed  by  Hack  [2],  Mayer  [5],  and  others  for  the  basin  shape  parameter;  i.e.,  a  basin 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  28,  NO.  4,  DECEMBER  1981 


MODEL  EOR  GENERATING  A  DRAINAGE  BASIN 


663 


that  is  slightly  elongated  in  the  downstream  direction.  It  also  correlates  closely  with  the  results 
plotted  in  Figure  2  which  show  that  the  equlibrium  solution  is  attained  when  a  =  2. 68,  0  =  4 
and  h  =  0.55,  and  tends  to  validate  the  assumption  made  in  deriving  equation  (7)  that  h  is  rela¬ 
tively  constant  in  basins  of  adjacent  order.  Therefore,  the  model  exhibits  the  desired  balance 
and,  since  the  results  correspond  closely  to  observed  data,  also  tends  to  validate  the  underlying 
relationships  empirically  derived  by  Horton,  Hack,  and  Schumm. 

SUMMARY 

The  model  described  in  this  paper  proved  quite  useful  in  estimating  stream  crossing 
requirements  for  military  operations.  Together  with  a  companion  model,  the  area  equation  was 
used  to  estimate  the  numbers,  sizes,  and  interfluvial  distances  of  streams  in  areas  of  interest. 
Although  the  mode)  behaves  quite  well  using  equilibrium  values  for  the  model  parameters, 
model  behavior  is  enhanced  by  quantifying  the  parameters  in  the  specific  areas  of  interest. 
Recommendations  to  overprint  these  data  on  standard  topographic  maps  were  an  outgrowth  of 
this  modeling  effort. 
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ABSTRACT 

This  paper  deals  with  a  flow-shop  problem  where  the  n  jobs  are  being  pro¬ 
cessed  uninterrupted  by  m  machines.  A  comprehensive  theory  based  on  "an 
earliest  starting  time  of  a  job"  concept  produced  the  most  efficient  solution 
method  for  a  variety  of  optimization  criteria.  The  paper  also  rectifies  several 
known  results  in  this  area. 


1.  INTRODUCTION 

Consider  a  flow  shop  problem  (later  called  UFP)  where  n  jobs  are  being  processed  without 
interruptions  by  m  machines  in  the  same  technological  order.  This  model  was  formulated  and 
solved  in  1972  by  S.  S.  Redd i  and  C.  V.  Ramamoorthy  [4]  and  by  D.  A.  Wismer  [6],  J.  N.  D. 
Gupta  [1]  generalized  the  method  of  [4]  in  1976  providing  a  new  theoretical  derivation.  The 
theory  of  [6]  and  [1],  however,  is  more  complicated  than  that  of  the  original  flowshop  problem. 
This  paper  offers  another  framework  utilizing  the  "earliest  time  when  a  job  starts  being  pro¬ 
cessed"  concept  which  makes  the  theory  of  the  UFP  simple.  It  shows  that  the  approach  based 
on  the  difference  of  earliest  starting  times  of  two  consecutive  jobs  (see  formula  (5))  leads  to 
the  most  efficient  method  for  a  variety  of  objective  functions.  The  paper  also  rectifies  several 
results  and  conclusions  of  the  existing  literature. 

2.  THE  MODEL 

We  will  adopt  the  following  notations: 

M,  —  s-th  machine,  s  -  1,2,  ....  m, 

r  —  job  r,  r  •=  1,2 . n, 

trs  —  processing  time  of  job  ron  Ms,  trs  >  0, 

T(P)  —  completion  time  of  permutation  P  =  p\,p2,  ....  p„  of  numbers  1,2,  ....  n, 
tf  —  earliest  time  when  Ms  starts  processing  job  p,  of  sequence  P. 

For  simplicity  assume  t}  =  /,  and  t\  -  0.  Then  for  each  k  =  1,2 . m 

(1)  I  V 

j-i 
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Since  >  0,  one  can  prove  the  following  theorem  of  [  1 1 . 

THEOREM  1:  For  a  schedule  to  be  feasible  it  is  necessary  that  all  machines  process  the 
jobs  in  the  same  order. 

PROOF:  Suppose  machines  Mk  and  Mk+i  do  not  process  the  jobs  in  the  same  order.  One 
can  assume  that  job  a  follows  immediately  job  b  on  A/*+l,  while  a  precedes  b  on  Mk.  Since  b  is 
being  operated  without  interruptions  the  processing  of  a  will  stop  for  at  least  ibk  +  tbk+i  >  0 
which  violates  the  feasibility  of  the  schedule,  Qed. 

Theorem  1  of  [1]  where  r„  >  0  is  not  true ,  however,  as  the  authors  of  [3)  have  shown  (by 

a  counterexample).*  According  to  (1)  and  (2)  the  //,  and  T(P)  can  be  calculated  given  the 

appropriate  Define 

(3)  d(pt,  a+i)  =  f,+i  -  t,. 

Then 

(4)  tj  =  /,  +  £  dip,,  pl+l)  =  £  d(p„  pi+l). 

/-I  i-i 

To  find  d(pi,  p;+i)  we  need  the  following  theorem.  Consider  an  arbitrary  sequence  P 
where  a  =  ph  b  =  p,+1.  Assume  that  the  processing  of  job  a  starts  at  the  earliest  time  . 

THEOREM  2: 


d(a,b)  =  max  jr  tas  -  £  ks 

I  r_  I 


PROOF:  The  problem  of  jobs  a  and  b  being  processed  uninterrupted  on  machines  A/,, 

M2 . Mm  can  be  viewed  as  a  classical  two-machine  flow-shop  model  where  "machines"  a 

and  b  handle  sequence  ....  Mm  of  "items"  Ms  in  the  same  order  a,b  and  where  pro¬ 

cessing  on  the  first  "machine"  a  goes  on  uninterrupted.  According  to  [2]  the  minimum  waiting 
time  of  "machine"  b  that  operates  sequence  M\,Mi,  ,  Mm  is  equal  to  the  right  hand  side  of 
(5).  The  processing  on  b  will  go  on  uninterrupted  once  b  starts  operating  at  t,  +  d(a,b)  which 
is  its  earlier  starting  time  tl+]  once  d(a,b )  is  minimum. 

Observe  that  (5)  holds  for  every  sequence  where  job  b  immediately  follows  job  a. 


•The  proof  of  this  theorem  (see  111,  p.  236)  is  deficient  due  to  the  following  gaps:  a)  it  overlooked  the  fact  that  the 
waiting  time  for  job  a  may  be  in  zero  whenever  -  0  even  though  rb„2  >  0  (and  not  >  r6m|  +  t as  slated  111. 
and  b)  different  processing  orders  of  mt  and  mz  (say  12345  and  51234)  do  not  necessarily  assure  an  existence  of  a 
schedule  where  a  immediately  follows  b  on  m2  and  b  immediately  follows  a  on  m  | 
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According  to  (2)  and  (4), 

T(P)  =  r„  +  £  iPn!  -  I  d(p„  p,+1)  +  £  t,mS. 

S-l  /—  1  5-1 

Hence,  the  UFP  is  equivalent  to  an  n  +  1-city  traveling  salesman  problem  (TSP)  where  the  dis¬ 
tance  cost  matrix  C  -  {caJ,  a.b  -  0, 1 . n  is  defined  by 


c0b  ~ 

cab  -  <f(a.6) 

V  i  *  0, 

V  a,f>  ^  0,  a  *  6, 

m 

^aO  *  (m> 

5-1 

V  a  *  0, 

r  ■*  oo 
'*aa  * 

V  a. 

P-  P1.P2 . Pn  is  an 

optimal  solution  of  the  UFP  if  and  only  if  0,pi,p2>  . . 

•  .  Pn,  0  is 

an  optimal  tour  of  the  TSP. 


3.  IDLE  TIME  APPROACHES 

Let  lk(a,b )  be  the  idle  time  between  two  consecutive  jobs  a  =  p,  and  b  -  p,+i  on  Mk. 
Then  lk  —  tb  -  /*  -  tak  or  (see  (D) 

(7)  Ik(a,b)  «  Ma.6)  +  *£  r4j  -  £  tas. 

i-l  i-2 

Due  to  (5) 

(8)  /,(n.6)-  max  £  r„  -  £  r>,,  0  . 

2«tl/<m  J-1 

While  Reddi  and  Ramamoorthy  [4]  based  their  approach  on  I\(a,  b )  using  (8),  Gupta  [1]  pre¬ 
ferred  Ik(a,b ),  It  ^  2,  as  defined  by  (7)  and  (8).  To  minimize  HP)*  they  solved  an 
equivalent  TSP  where  formula  (6)  for  distance  cost  matrices  C'  of  [4]  and  C  "  of  [  1  ]  reads** 


n  fn 


(60 

c0b  ™ 

r-\ 

m—  1 

cab  "  /](a,6),  cag  —  £  r„,  cafl  —  00 
1-2 

(6") 

c04  “  £ 

1-! 

c'ab  “  c'aO  -  0,  c'aa  -  °° 

'Gupta  minimized  /(/>)  -  T(P)  rf  +  £  /„(/>,,  pl+l)  -  HP)  -  Constant  while  Reddi-Ramamoorthy 

JL  «-l  ‘  m 

minimized  T(P)  -  £  i,  \  +  L.  l\^Pi'Pi*0  +  L,  <p  r 
(-1  '  1-1  1-2  " 

"Notice  that  C'  -  C  +  +  vj),  C"-  C  +  U"  +  vjj  where  C  is  defined  by  (6)  and 

ui  -  t,  irl.  u‘,  -  l<a<n,v;-0,  0  <  i  <  n, 

r— I 

"o“0vo-°.  £  W  1  <  a  <  n.  v;r-  |  V'OG 

i-i  i-i 
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4.  OTHER  OPTIMIZATION  CRITERIA 


Let  W^IP)  be  the  wailing  time  of  machine  Mk  (counted  from  r,  =  0)  that  processes 
sequence  P  Then 

-  ft  +  I  Mft.  /»,*>). 

i-t 

Gupta  [11  generalized  the  model  of  [4]  using  the  following  objective  function. 

(9)  /(/>)=£  wkWk{P). 

k-2 

Notice  that  for  wm  =  1,  and  wk  =  0  otherwise,  /( P)  and  T(P)  differ  by  a  constant  (see  first 
footnote  on  page  667). 


One  should  point  out  that  the  summation  in  (9)  must  include  term  w,W'j(/>)  since 
ffj (P)  is  nor  fixed,  contrary  to  what  is  claimed  (see  [1],  p.  237).  To  show  that  W'j(/>)  depends 
on  P consider  a  2-job,  2  machine  UFP  where  rn  =  1 .  r12  =  5,  h\  =  h,  / 22  =  3. 

Then  ^,(1,2)  =  0  while  IP, (21)  =  2. 


Another  objective  function  is 

m  m 

go)  /</»)- 2>*(/*-ff)- 

*-l  A-l 


X,  fP,k 
/- 1 


m  n-  1 

+  I  I  a+i) 

*-i  1- 1 


which  a  weighted  sum  of  time  involvement  of  all  machines.  The  first  term  of  (10)  being  con¬ 
stant  can  be  dropped  without  missing  optimality.  Each  of  these  problems  can  be  easily 
transformed  into  an  equivalent  TSP  (using  formulas  similar  to  (11)  of  [lj.  J.  M.  Van  Deman 
and  K.  R.  Baker  [5]  solved  the  USP  with  a  mean  flow  time  criterion.  Since  the  completion  time 

m 

of  job  Pj  on  Mm,  t"  +  tp ,  m,  is  equal  to  /,  +  £  tps  (see  ( 1 ) ) ,  the  objective  function  is 


f(P)  = 


t,  + 


V 


l  " 

=  —  T  t.  +  Constant. 

nk 


Hence,  one  can  instead  minimize  (see  also  (4)) 


(in  z(p)  =  1 0  =  1 1  dip,.  p,+l). 

y-t  )-\  i-t 


5.  CONCLUSION 


As  we  have  learned,  any  objective  function  of  the  UFP  can  be  expressed  in  terms  of 
either  d(a,  b),  !\(a,  b).  or  lk(a,  b ),  as  defined  by  (5),  (8),  and  (7).  Those  formulas  clearly 
indicate  that  d(a,  b)  requires  minimal  computational  effort,  while  l\(a,  b )  is  second  best. 

One  should  obviously  use  d(a,  b)  when  dealing  with  the  mean  flow-time  criterion  (see 

(0). 
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In  all  other  cases,  the  efficiency  of  an  approach  should  be  measured  in  terms  of  time 
necessary  to  calculate  the  travel  cost  matrix  of  the  respective  TSP  (see  [1],  p.  241).  If  the  com¬ 
pletion  time  T(P)  is  being  minimized,  then  according  to  (6),  (6’),  and  (6")  the  J(a,  b > 
approach  has  an  advantage  over  the  remaining  approaches  since  (5)  is  to  be  used  n(n  -  1) 
times.  The  efficiency  of  the  d(a.  b)  approach  diminishes,  however,  once  Wismer's  algorithm 

[6]  is  applied  to  determine  the  d(a,  b).  Gupta  [1]  has  shown  that  the  approach  based  on 
lm(a,  b)  is  more  efficient  than  Wismer's  method.* 

Suppose  the  objective  function  explicitly  depends  on  lk(a,  b )  as  in  Section  4.  Taking 
advantage  of  (1)  we  can  express  lk(a.  b)  =  t£  -  —  tak  as  follows: 

(7)  Ik(a,  b)  =  d(a.  b)  +  £  ihs  -  £  /„,. 

s-l  s-1 

Due  to  (5),  formula  (7')  is  computationally  more  efficient  than  1 7) . 
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'One  can  considerably  improve  the  efficiency  and  clarity  of  Wismer's  algorithm  by  reformulating  its  pivotal  steps  3  and 
4  as  follows. 

a  Find  £*  and  Ff*1  from  £*  -  F*'1  +  r*  1  and  Fj*1  -  F*  +  r*. 
b.  Is  £*  >  F**1  ?  If  not.  set  <  -  T**]  -  £‘  -  F*  -  F**1;  go  to  5. 

If  it  is,  set  d$y  “  0  and  F*  —  £*;  go  to  5. 

This  improvement  still  falls  behind  formula  (5)  in  terms  of  efficiency. 
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ABSTRACT 

This  paper  provides  a  method  for  solving  linear  fractional  interval  program¬ 
ming  problems  in  integers  with  the  help  of  a  branch  and  bound  technique. 


1.  INTRODUCTION 

This  paper  is  concerned  with  describing  a  systematic  procedure  for  solving  a  linear  frac¬ 
tional  interval  programming  problem  with  the  additional  condition  that  the  variables  are 
integers. 

The  problem  is  as  follows: 


(1) 


maximize  Z 


c'x  +  c0 


d'x  +  d0 
subject  to  b~  <  Ax  4  b 
x  has  integral  components. 


Cix) 
D(x)  ’ 


This  is  known  as  a  fractional  interval  integer  programming  problem  (FIIP).  A  linear  frac¬ 
tional  interval  programming  problem,  abbreviated  (FIP),  is  defined  as: 


(2) 


maximize 


c'x  +  C0 
d'x  +  d0 


C(x) 

D(x) 


subject  to  b  ^  Ax  <  b+. 


where  c',  c0,  d',  d0,  b  ,  b*  and  A  are  given. 


The  problem  was  introduced  in  [4]  and  solved  explicitly  in  the  feasible  bounded  case  with 
A  of  full  row  rank,  see  also  [2],  [6], 
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The  linear  fractional  programming  problem  in  all  generality  was  reduced  by  Charnes  and 
Cooper  (3]  to,  at  most,  a  pair  of  ordinary  linear  programming  problems,  by  adjoining  a  specified 
constraint  to  the  given  set  of  constraints. 

A  primal  algorithm  is  given  by  A.  Charnes,  D.  Granot  and  F.  Granot  [5]  for  solving  the 
(FIP)  problem  directly  and  this  utilizes  the  special  structure  of  the  interval  constraints. 

The  purpose  of  this  paper  is  to  obtain  an  integer  solution  to  the  (FIP)  problem  and  pro¬ 
duces,  after  a  finite  number  of  iterations,  an  optimal  integer  solution  to  (FIIP). 

2.  PRELIMINARY  RESULTS 


Consider  the  fractional  programming  problem  (FIP): 


maximize 
subject  to 


[  c'x  +  c0  _  CQc)  | 
\d'x  +  d0  00c)  }’ 

b~  <  Ax  <  b+. 


For  the  following,  let  b~,  b+  €  Rm\  c',  d\  x  €  R"\  c0,  d0  €  R  and  A  be  a  real  mxn 
matrix.  N{A )  denotes  the  null  space  of  A,  R(A)  the  range  space  of  A,  1  the  usual  orthogonal¬ 
ity  relation  in  R",  and  /?rmx"  denotes  the  set  of  all  mxn  matrices  of  rank  r. 


To  exclude  trivial  cases,  the  following  assumptions  are  made: 

ASSUMPTION  1:  (FIP)  is  feasible,  i.e., 

S  -  {x  €  R"\  b~  <  Ax  <  6+)  *  9. 


ASSUMPTION  2:  D(x):  -  d'x  +  d0  >  0  over  S,  the  feasible  region. 

ASSUMPTION  3:  There  exists  no  X  €  R  such  that 
D(x)  =*  AC(x):  —  A(c'x  +  eg)  for  all  x  €  S. 

ASSUMPTION  4:  Cl  N(A)  and  dlN(A),  since  a  feasible  (FIP)  is  unbounded  if  either 
c  €  JVC4),or  d  €  N(A)  [41. 

LEMMA  1:  Let  A  €  Rrmxn  and  D  €  R'x"  be  such  that 

(3)  R(D')  -  R(,A') 

then  AD‘  €  Rf",  i.e.,  AD'  is  of  full  column  rank  (91. 

LEMMA  2  [51:  Let  D  be  as  in  lemma  1  and  suppose  that  (FIP)  is  given  with  A  SRI**" 
and  clN(i4 ),  dl  N(A).  Then,  the  optimal  solution  of  (FIP)  is: 

(4)  D'y*  +  W(A)) 
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where  y*  is  any  optimal  solution  of 

.  .  c'D'y  +  c0 

(5)  maximize - 

d'D'y  +  do 

subject  to  b~  <  A  D[v  ^  b +. 

PROOF:  Since  R(A')  =  N  (AY  and  since  c  C  N(AYs  d  €  N(A  P,  it  follows  that  the 
optimal  solution  to  (FIP)  is  of  the  form 

(6)  x*  +  N(A). 

Where  x*  is  an  optimal  solution  to 

,,,  .  .  c'x  +  c0 

(7)  maximize  - , 

d'x  +  do 

subject  to  b~  ^  Ax  <  6+, 
x  €  R(A'). 

But,  since  R(A  0  =  R(D 0  and  x  €  R(A*)  can  be  equivalently  written  as: 
x  =  D'y,  y  €  Rr. 

Substituting  x  *  D'y  in  (7)  results  in  the  equivalent  problem,  which  completes  the  proof. 

Without  loss  of  generality  it  can  be  assumed,  that  there  exists  an  xfX  such  that 
D(x)  >  0. 

Following  Martos  (8],  every  feasible  point  x  to  (FIP)  is  classified  according  to  the  table 
below: 


X 

Type  1 

Type  2 

"good”  point 

D(x)  >  0 

Dix)  =  0,  C(x)  <  0 

"bad"  point 

D(x)  <  0 

D(x)  -  0,  C(x)  >  0 

"singular"  point 

D(x)  —  0  —  C(x) 

— 

A  complete  analysis  of  (FIP)  in  all  generality  and  generation  of  an  optimal  solution  to 
(FIP),  if  one  exists,  is  given  by  Charnes  and  others  [5j. 

3.  ALGORITHM 

Let  US  assume  that  (FIP),  given  in  (2),  is  feasible,  and  let  cl  N(A),  dl  N(A )  and  A  be 
of  full  column  rank  representation  (see  Lemma  2)  and  D(x)  >  0  on  S. 

From  Martos  [8],  we  recall: 

THEOREM  I:  If  5  ^  0  and  the  above  assumptions  are  satisfied,  then  the  fractional 
function  attains  a  finite  maximum  on  S,  which  is  taken  on  at  least  one  extreme  point. 
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Let  x*  be  a  feasible  extreme  point  of  5,  and  B  a  basis  for  the  rows  of  A  which  includes  all 
the  rows  of  the  linearly  independent  constraints  satisfied  as  equalities  at  x*,  and  M  be  the  com¬ 
pletion  of  B  to  A.  Let  bg ,  b bg ,  b £  be  the  partitions  of  the  vectors  b~  and  b+  which 
correspond  to  the  partition  of  A  to  B  and  N,  respectively. 

(8)  Let  y*  -  Bx\ 
and 

(9)  z, O')  =  (c'B~')i  ( d0  +  d'B-'y)  =  (</'«-»),  (c0  +  c‘B~ly) 

LEMMA  3:  x*  is  an  optimal  solution  to  (FIP)  if 
z,(y*)  <  0  whenever  y*  =  b~ 
z, O'*)  ^  0  whenever  y*=  b,+ 

An  algorithm  given  in  [5]  shows  that,  after  a  known  number  of  iterations,  an  extreme 
point  solution  is  obtained.  If  this  extreme  point  satisfies  the  optimality  criterion  (10),  then  the 
algorithm  terminates  with  an  optimal  solution  to  (FIP)  given  by  xopl  =  B~ly.  If,  however,  the 
optimality  criterion  is  not  satisfied,  the  algorithm  proceeds  along  adjacent  extreme  points  while 
improving  the  value  of  the  objective  function  until  an  optimal  extreme  point  to  (FIP)  is  pro¬ 
duced. 

We  solve  (2)  as  in  [5]  and  thus  we  obtain  the  maximum  value  of  the  objective  function  as 
A,  say.  If  all  the  components  of  x  are  also  all  integers,  then  this  is  obviously  the  required 
optimum  solution.  If  the  components  of  x  are  not  all  integers,  we  obtain  one  after  another 

more  restrictive  upper  bounds  Aj,  A2 . A,  in  the  same  way  as  Land  and  Doig  [7]  has 

done  for  linear  objective  functions. 

It  is  possible  to  transfer  the  problem  (1)  to  an  equivalent  linear  programming  problem 
and  then  to  apply  existing  methods  for  solving  the  equivalent  problem  in  integers.  However, 
since  such  a  transformation  will  increase  the  effective  size  of  the  problem  and  will  destroy  the 
special  structure  of  the  constraints,  we  apply  Land  and  Doig  method  to  problem  ( 1 )  to  obtain 
an  integer  solution.  It  maintains  the  original  two-sided  constraints  structure  and  takes  advan¬ 
tage  of  this  special  structure  during  the  solution  procedure  and  thus  will  be  more  efficient  for 
problems  which  occur  in  (IP)  form  rather  than  the  equivalent  (LP)  codes. 

Let  the  value  of  any  component  of  x,  say  xp,  be  a  (nonintegral)  for  the  maximum  value 
A  of  the  objective  function.  Now,  xp  is  forced  to  take  an  integral  value  and  hence  is  decreased 
to  at  least  a0  or  increased  to  at  least  a0  +  1,  where  a0  denotes  the  largest  integer  less  than  or 
equal  to  a . 

We  now  solve  the  fractional  interval  programming  problems  by  substituting  xp  -  a0,  and 
then  we  solve  it  by  substituting  xp  -  a0  +  1  by  the  same  method  [51.  Let  the  two  values  of  the 
objective  function  thus  obtained  be  A'  and  A ",  respectively.  However,  if  one  of  the  above  two 
problems,  say  the  fractional  interval  program  with  the  condition  xp  =  a0  is  not  feasible.  A'  does 
not  exist.  It  implies  that  the  value  of  xp  which  only  satisfies  xp  >  a0  +  1,  need  be  considered 
in  future  discussion.  If  neither  A'  nor  A  “exists,  xp  cannot  be  constrained  to  an  integral  value 
and  the  problem  possesses  no  feasible  solution. 
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We  now  find  A(  —  max(A\  A").  For  the  objective  function  having  A,  as  one  of  its 
values,  let  the  value  of  the  variable  xp  be  0  (integral),  and  also  suppose  all  the  other  com¬ 
ponents  of  x  are  still  not  integers.  To  find  the  second  best  solution,  determine  the  maximum 
values  of  the  fractional  interval  program  with  xp  -/3  -  1  and  then,  with  xp  =  /3  +  1 .  (One  of 
these  values  has  already  been  obtained  as  A'  or  A".)  Also,  as  Aj  is  not  the  required  solution,  a 
new  variable  (say,  xq)  is  chosen  from  those  that  are  not  integral  at  this  stage.  Let  xq  be  equal 
to  y  (nonintegral),  say.  Hence,  two  more  interval  programs  are  solved,  viz.,  the  fractional 
interval  program  with  xp  -  0,  xq  -  y0;  and  then,  with  xp  -  /3,  xq  —  y°  +  1.  Let  A2  be  the  max¬ 
imum  value  of  the  objective  function  amongst  the  values  just  obtained  and  those  that  are 
obtained  by  substituting  xp  -  0  -  1  and  xp  -  /3  +  1.  The  whole  argument  can  now  be  repeated 
with  A2  replacing  A!  as  the  current  upper  bound  on  the  optimal  value  of  the  objective  function. 

Continuing  the  above  process,  a  tree  is  formed  each  of  whose  vertices  represents  a  known 
set  of  integer  constraints  (for  example,  the  vertex  associated  with  A,  value  represents  xp  =  /3). 
A  branch  terminates  if  it  reaches  a  vertex  having  a  nonfeasible  solution.  Ultimately,  either  all 
branches  are  terminated  in  vertices  having  no  feasible  solutions,  or  else  a  vertex  having  the 
maximum  value  A*,  say,  is  reached  for  which  all  the  components  for  xare  integers.  This  must 
be  the  required  optimum  solution. 


4.  NUMERICAL  EXAMPLE 


For  the  sake  of  simplicity  and  easy  understanding  of  the  method,  we  shall  now  solve  an 
example. 


EXAMPLE: 


maximize 
subject  to 


3x|  +  20*2  +  4x3  +  4 
2x|  +  1  5x2  +  3xj  +  3  ’ 

8  <  3x!  +  x2  +  3x3  <  16, 

2  <  xj  +  3x2  <  9, 

-1  <  3x|  +  5x2  +  x3  <  11, 

3  <  2xj  +  4xj  <  10, 
xu  x2,  xj  are  integers. 


Thus,  we  have 


3  1  3 

8 

16 

A  - 

1  3  0 
3  5  1 

,  b~  - 

2 

-1 

,  b*  - 

9 

11 

0  2  4 

3 

10 
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The  problem  when  solved  by  primal  algorithm  method  [5]  gives 


x 


19 

i± 

19 

1_ 

19 


and  Z 


1 


n_ 

17  ’ 


but  it  does  not  satisfy  the  integrality  condition. 


We  determine  the  first  integral  values  of  min  x-3,  and  max  x3.  Here,  the  first  integer 
value  of  min  x3  is  1.  Hence,  we  maximize  the  given  objective  function,  subject  to  the  given 
constraints  by  substituting  x3  *  1 .  The  problem  becomes 

3x|  +  20x2  +  8 

maximize  Z  =  - - — - —7, 

2x,  +  15x2  +  6 

subject  to  5  ^  3x!  +  x2  <  13, 

2  <  x,  +  3x2  <  9, 

—2  <  3x|  +  Sx2  <  10, 

-1  <  2x2  <  6. 

This  problem  when  solved  by  [5]  gives  x,  =  3y,  x2  -  -y  and  Z  -  lyy. 

I  1  8 

The  first  integer  value  of  max  x3  is  2.  This  gives  x,  -  3y,  x2  “  -y  and  Z  =  1  -jy. 


Further,  min  x3  for  its  second  integer  value  is  x3  -  0.  With  this  additional  substitution, 
we  do  not  obtain  any  feasible  solution. 

The  maximum  value  of  Z  upto  this  point  is  lyy  with  x3  =  3y,  x2  -1  -y  and  x3  =  1. 
Thus,  A!  is  given  the  value  1~. 

We  now  maximize  the  given  objective  function  subject  to  the  given  constraints  with 
x3  —  1,  xj  -  3.  The  value  of  Z  comes  out  to  be  1—  and  x2  =  0.  This  gives  first  integer  solu¬ 
tion.  Further,  the  objective  function  subject  to  the  given  constraints  with  x3  =  1,  x,  -  4  does 
not  give  any  feasible  solution. 

Since  lyy  >  1-jy,  we  solve  the  given  problem  with  x3  =  2,  X|  -  3,  the  second  integer 
solution  is  obtained  i.e.,  x\  «  3,  x2  -  0,  x3  **  2  and  Z  •*  1-|. 

Further,  with  x3  -  2,  X|  -  4,  the  solution  is  infeasible. 

Thus,  the  required  optimal  integer  solution  is  Z  -  1-y,  Xj  -  3.  x2  -  0,  x3  -  1. 
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ABSTRACT 

Queueing  systems  which  include  the  possibility  for  a  customer  to  return  to 
the  same  server  for  additional  service  are  called  queueing  systems  with  feed¬ 
back.  Such  systems  occur  in  computer  networks  for  example.  In  these  systems 
a  chosen  customer  will  wait  in  the  queue,  be  serviced  and  then,  with  probability 
p,  return  to  wait  again,  be  serviced  again  and  continue  this  process  until,  with 
probability  (1  -  p)  -  q.  it  depart;  the  system  never  to  return.  The  time  of 
waiting  plus  service  time,  the  nth  time  the  customer  goes  through,  we  will  call 
his  nth  sojourn  time.  The  (random)  sum  of  these  sojourn  times  we  will  call  the 
total  sojourn  time  (abbreviated,  sojourn  time  when  there  is  no  confusion  which 
sojourn  time  we  are  talking  about).  In  this  paper  we  study  the  total  sojourn 
time  in  a  queueing  system  with  feedback.  We  give  the  details  for  M/G/l 
queues  in  which  the  decision  to  feedback  or  not  is  a  Bernoulli  process.  While 
the  details  of  the  computations  can  be  more  difficult,  the  structure  of  the  so¬ 
journ  time  process  is  unchanged  for  the  M/G/ 1  queue  with  a  more  general  de¬ 
cision  process  as  will  be  shown.  We  assume  the  reader  is  familiar  with  Disney, 
McNickle  and  Simon  [1]. 


1.  INTRODUCTION 

One  of  the  major,  largely  unsolved  problems  in  queueing  systems  with  feedback,  is  the 
sojourn  time  problem.  Upon  entry  to  the  system,  a  customer  spends  some  time  waiting  and 
being  served.  We  call  this  total  the  customer’s  sojourn  time  on  his  first  pass  through  the 
server.  At  the  conclusion  of  this  time,  the  customer  immediately  returns  for  more  service, 
with  probability  p ,  or  departs  never  to  return.  In  the  latter  case  the  customer’s  total  sojourn 
time  and  it’s  sojourn  time  on  the  first  pass  are  equal.  In  the  former  case  the  customer  immedi¬ 
ately  experiences  another  sojourn  time  on  it’s  second  pass.  Thus,  in  general,  the  total  sojourn 
time  is  a  (random)  sum  of  the  sojourn  times  on  each  pass  through  the  server.  However,  the 
number  of  passes  through  the  server  is  a  random  variable.  Under  the  assumption  that  the  deci¬ 
sion  to  feed  back  or  not  is  a  Bernoulli  random  variable,  the  number  of  passes  through  the 
server  is  then  a  geometrically  distributed  random  variable. 
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One  can  consider  many  such  sojourn  time  problems  depending  on  where  the  feedback 
unit  is  allowed  to  reenter  the  queue.  If,  for  example,  it  returns  to  the  head  of  the  queue,  one 
finds  trivially  that  the  total  sojourn  time  is  equivalent  to  the  sojourn  time  of  a  customer  in  a 
queue  without  feedback.  However,  if  the  feedback  reappears  anywhere  else  in  the  line  (e  g., 
the  feedback  goes  to  the  end  of  the  line  as  is  the  most  common  assumption),  determining  the 
total  sojourn  time  distribution  seems  to  be  a  formidable  task.  The  problem  is  caused  by  the 
fact  that  it  is  the  sum  of  the  sojourn  times  that  that  customer  spends  each  time  he  goes  through 
the  server  and,  these  times  are  not  independent  of  each  other.  We  will  study  this  total  sojourn 
time  problem. 

Takacs  14]  and  Montazer-Haghighi  [3]  are  the  only  papers  that  we  are  aware  of  that  study 
this  problem  in  detail.  Takacs’  results  are  for  M/G/I  queues  with  an  instantaneous  Bernoulli 
feedback  process,  as  are  most  of  ours  though  our  methods  are  different. 

Montazer-Haghighi  studies  the  M/M/m  queue  with  instantaneous,  Bernoulli  feedback 
with  particular  emphasis  on  m  =  2.  He  finds  a  transform  for  the  sojourn  time  distribution  in 
the  steady  state,  and  the  first  two  moments  of  this  distribution  as  well  as  the  steady  state  queue 
length  distribution. 

2.  THE  PROBLEM  AND  NOTATION 

We  are  concerned  with  sojourn  times  in  M/G/l  queues  with  instantaneous,  Bernoulli 
feedback.  We  assume  the  reader  is  familiar  with  Disney,  McNickle  and  Simon  [1], 

Choose  a  customer,  C.  We  will  follow  it  through  the  system.  Suppose  that  customer 
feeds  back  to  the  end  of  the  queue,  instantaneously  K  times  in  all.  K  is  a  geometric  distributed 
random  variable  on  {0. 1,2,  . .  !  with  parameter  p.  Each  time  the  customer  enters  the  queue, 
the  queue  discipline  is  first  in-first  out  (FIFO).  Then  define: 


2V0  =  the  number  of  customers  ahead  of  C  upon  its  initial  arrival. 

N„  =  the  number  of  customers  left  behind  the  nth  time  C  leaves  the  server  ( n  = 
1,2 . K). 

T„  =  the  time  at  which  Cleaves  the  server  for  the  nth  time  (n  »  1,2 . K). 


Y(Tn)  = 


1 ,  if  C  feeds  back  at  T„ 
0.  otherwise. 


( Kj  is  a  sequence  of  independent  identically  distributed  random  variables  with 
Pr[  Y„  —  1]  =  p,  Pr{  Yn  -  0]  -  1  -  p  =  q.  That  is  { Y„}  is  a  Bernoulli  process. 

X„  -  T„  -  r„_,  -  the  sojourn  time  of  C  on  its  nth  trip  through  the  system 
(n  «  2.3 . K ).  That  is,  Xn  is  the  sum  of  the  time  C  spends  waiting  for  ser¬ 

vice  plus  the  time  spent  in  service  the  nth  time  C  enters  the  queue. 


X\  -  T\  -  the  sojourn  time  of  C  on  it’s  first  trip  through  the  server. 

T'-Xl  +  X2  +  ...  +  XK 

B„(y )  —  number  of  exogenous  arrivals  to  the  queue  during  the  nth  sojourn  lime  (of 
length  y)  of  C(n  —  1,2,  ....  K).  B„(y)  is  a  Poisson  random  variable  (Ay)  for 
each  n. 
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Given  N„_ 

C„  -  the  number  of  customers  ahead  of  C  at  the  start  of  it’s  nth  pass  through  the  sys¬ 
tem  who  feedback.  C„  is  a  binomial  random  variable,  with  parameters  p, 

3.  RESULTS 

A  point  often  made  concerning  the  queue  length  process  for  the  M/G/l  queue  with  (Fj 
a  Bernoulli  process  is  that  it  is  equivalent  to  a  queue  without  feedback  in  the  sense  that  there 
always  exists  an  M/G/l  queue  without  feedback  whose  limiting  queue  length  distribution  is 
equal  to  that  for  the  M/G/l  queue  with  Bernoulli  feedback.  The  question  then  arises  as  to 
whether  there  is  an  M/G/I  queue  without  feedback  whose  limiting  sojourn  time  distribution  is 
equal  to  that  of  the  M/G/l  queue  with  Bernoulli  feedback.  Lemma  1  is  a  partial  answer  to  the 
question. 

LEMMA  1:  If  the  feedback  unit  goes  to  the  tail  of  the  line  and  if  the  M/G/l  queue 
without  feedback  and  the  M/G/l  queue  with  Bernoulli  feedback  are  to  have  the  same  first 
three  moments  for  their  respective  total  service  time  distributions,  then  there  is  no  M/G/l 
queue  without  feedback  whose  limiting  total  sojourn  time  distribution  is  equal  to  that  of  the 
M/G/l  queue  with  Bernoulli  feedback. 

PROOF:  Let  T*  be  the  sojourn  time  for  a  M/G/l  queue  without  feedback.  Let  a„  be 
the  nth  moment  about  0  of  the  service  time  (for  one  pass  through  the  server)  of  the  feedback 
queue.  Let  t„  be  the  nth  moment  about  0  for  the  total  service  time  in  the  queue  without  feed¬ 
back.  If  the  total  service  times  of  the  two  queues  are  to  have  the  same  first  three  moments 
then  it  is  a  simple  exercise  to  show 

<7  ' 

+  2(1  -  q)a? 

q  q 2 

«3  6(1  —  (?)«[« 2  _  6(1  —  q)2a? 

T +  P  +  P  ' 

Then  using  the  expected  sojourn  time  and  the  second  moment  of  the  sojourn  time  for  the 
queue  with  feedback  given  by  Takacs  (4]  and  the  well  known  results  (for  example,  see  Klein- 
rock  [21;  Section  5.7)  of  corresponding  moments  for  the  M/G/l  queue  without  feedback  one 
obtains 

E(T")  -  £(T0, 

E((T)2)  *  E{{rn2) 

unless  q  -  1  (i.e.,  there  is  no  feedback).  □ 

The  following  results  structure  the  problem  as  a  Markov  renewal  process  for  the  M/G/l 
queue  with  [  Y„ }  a  Bernoulli  process, 

THEOREM  1:  lN„,  Xn]  is  a  delayed  Markov  renewal  process  for  each  n  <  K. 


Ti  ” 
r2  - 

t3  = 
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PROOF:  For  K  =  k.  One  needs  only  note  that  when  C  first  enters  the  queue,  it  finds  N0 
customers  already  in  line  and,  for  any  FIFO,  M/G/l  queue  Xx  depends  only  on  N0.  If  C  does 
not  feedback,  X\  is  its  total  sojourn  time.  However,  if  C  feeds  back  it  will  encounter  all  of 
those  customers  to  arrive  during  X\  and,  additionally,  those  among  the  N0  who  feedback.  But, 
the  number  ahead  of  C  if  it  feeds  back,  depends  only  on  X t  and  N0.  Since  these  results  are 
true  for  any  n  >  1,  { V„,  Xn)  has  the  Markov  renewal  property.  In  the  M/G/l  case  this  pro¬ 
cess  is  a  delayed  Markov  renewal  process  since  Yj  will  not  in  general  have  the  distribution  of 
Xn  for  n  =  2,3 .  □ 

Let  Hc( x)  be  the  remaining  service  time  of  the  customer  in  service  (if  any)  at  the  arrival 
of  C.  Let  H'(x)  be  the  /-fold  convolution  of  H  with  itself.  Then  we  have 


COROLLARY  1:  The  transition  functions  for  the  {Nn,  X„\  process  are  given  by 


and 


Here 


Du(x)  -  P(Nt  =•  j.  X,  $  xlJV0  -  /) 

f*  H(dy)A0j(y),  if  /  -  0, 

fJ(Hc  ♦  H')  ( dy)A,J(y ),  if  /  -  1.2,  . . . 


Q»W  =  P(Nn  -  j.  Xn  ^  =  /) 

H(dy)A0j(y),  if  /  -  0, 

f*  ffl+'WAyW.  if  /  —  1,2, 


AfJ(y) 


min(i.y) 

I 

m—0 


pmq‘  m 


{ ±y)J-m  e-K> 

ml 


PROOF:  The  result  is  made  apparent  as  follows.  Given  i  and  C,  the  number  feeding 
back  is  simply  a  Bernoulli  random  variable  with  parameters  i,p.  For  a  given  C,  and  a  given 
sojourn  time  the  nth  time  C  goes  through,  the  number  of  new  arrivals  is  a  Poisson  random 
variable.  Furthermore,  given  C  and  /,  the  new  arrival  process  and  the  feedback  process  are 
independent.  Therefore,  A^iy)  is  simply  the  distribution  function  for  the  total  number  of 
inputs  to  the  queue  during  C ’s  nth  pass  through.  Given  there  are  i  ahead  of  him  on  the  pass, 
H,+HT)  or  HC(T)  is  then  the  distribution  function  for  the  sojourn  time  of  C  on  his  nth  pass. 
Then  since 


Pr IN„  -  j.  X„  <  x\ /V„_,  -  i]  -  Pr[*„  <  =  i]  Pr[/V„  -  j\X„  <  x.  \B_,  -  /) 

the  result  follows  easily.  □ 

Then  since  {N„,  X„]  is  a  Markov  renewal  process  we  have 
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COROLLARY  2: 

<?,!*'(>’)  =  P(Nk  =  7.  T(  .v |  iV0  =  ,)  - 

=  /;  £  Dim(dx)Q“-"  (y  -  X J. 

m-  I 


PROOF:  For  K  =  k  these  are  the  usual  A:  step  transition  functions  for  a  delayed  Markov 
renewal  process.  C 


If  X  <  qp  there  exists  a  vector  tr  satisfying 
ir  =  irQ(°°). 


THEOREM  2:  For  a  M/G/l  queue  with  instantaneous  Bernoulli  feedback,  in  the  steady 
state,  the  sojourn  time  for  C  is  given  by 


P(T'  <  v)  =  rr 


I  Qik'(y)pk''q 


At—  1 


U, 


where  U  is  the  column  vector  whose  elements  are  all  1. 


PROOF:  Conditioned  on  C  feeding  back  k  times  Corollary  2  gives  the  joint  distribution 
of  Nk  and  T{  conditioned  on  /V0.  The  result  then  follows  by  removing  the  condition  for  the 
number  of  feedbacks  and  the  N0  and  finding  the  resulting  marginal  distribution  for  Tf.  C 

Thus  the  marginal  sojourn  time  of  C  is  given  by  Theorem  2.  But  notice  that,  in  general, 
sojourn  times  of  successive  customers  are  not  independent  and  therefore  the  sojourn  time  pro¬ 
cess  is  not  specified  by  Theorem  2  alone.  Notice  also  that  the  Bernoulli  process  ( Yn\  plays  a 
minor  role.  The  results  can  be  generalized  at  least  to  the  case 

P(Y„~u\Y,.x  =  v.  Nn  /.  S„  =  y). 

Thus,  in  principle,  the  sojourn  time  problem  for  M/G/l  queues  with  feedback  is  solved  for  a 
rather  large  class  of  {  Yn]  processes. 

There  is  an  interesting  question  in  these  results.  We  know  from  Lemma  1  that  there  is 
no  M/G/l  queue  without  feedback  whose  sojourn  time  distribution  is  equal  to  the  M/G/l 
queue  with  Bernoulli  feedback  if  the  two  service  times  are  to  have  the  same  r*rst  three 
moments.  If  we  relax  this  moment  requirement,  the  problem  can  be  exposed  as  follows:  Is 
there  any  M/G/l  queue  without  feedback  whose  sojourn  time  distribution  for  a  given  customer 
is  equal  to  that  sojourn  time  for  a  given  M/G/l  queue  with  Bernoulli  feedback? 
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ABSTRACT 

We  present  probabilistic  proofs  for  the  following  two  facts. 

(i)  A  k  out  of  n  system  of  i.i.d  (independent  identically  distributed)  IFR 
(increasing  failure  rate)  components  has  an  IFR  life  distribution. 

(ii)  A  compound  Poisson  process  with  nonnegative  i.i.d  jumps  with  PF}  dis¬ 
tribution  is  IFR. 


INTRODUCTION 

A  nonnegative  random  variable  T  is  said  to  possess  an  IFR  distribution  if  for  any  a  >  0, 
P(T  >  t  +  a\T  >  t)  is  nonincreasing  in  t.  In  words,  given  {T  >  f),  T  is  stochastically 
decreasing  in  t.  This  notion  of  aging  is  intuitively  well  understood  and  plays  an  important  role 
in  the  theory  of  reliability.  In  this  report  we  provide  probabilistic  proofs  for  the  following 
known  facts: 

(i)  A  k  out  of  n  system  of  i.i.d.  IFR  components  has  an  IFR  life  distribution 

(ii)  A  compound  Poisson  process  with  nonnegative  i.i.d.  jumps  with  PF2  distribution 
is  IFR. 

These  theorems  have  been  proven  before,  (i)  in  12]  and  (ii)  in  [4],  In  both  cases  the  approach 
was  purely  analytical.  By  using  a  probabilistic  approach  we  are  able  to  provide  better  insight  and 
intuition. 

1.  k-OUT-OF-n  SYSTEM 

Consider  an— n  component  coherent  system  4>.  The  componentsMifelength  T},  T2 . 

T„  are  assumed  to  be  i.i.d.  random  variables  having  distribution  F(F  -  1  -  F)  and  density 
F'  -  /  (for  further  discussion  of  coherent  systems  consult  Barlow-Proschan  13]).  Let 
Xt(t),  i  -  1 . *  be  0  or  1  according  to  whether  (7’  ^  /]  or  (7)  >  (],  respectively,  and  let 

ft 

X(t)  -  (Af|(/),  ....  X„(t ))  be  the  state  vector  at  time  t.  Let  N,  -  JT  X,(i)  denote  the 

i-i 

number  of  functioning  components  at  t.  Then  for  an  arbitrary  coherent  structure  we  have: 
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LEMMA  1: 

For  any  k,  Ot  =*  0, 1 . n ) 

E(N,\N,  >  At,  <f>iXit))  -  \)  >  EiN,\<l>iXit))  -  1). 


PROOF: 

Check  that  for  y  =*  J . n 

PiN,  >  j\N,>  k.  fiiXit))  =  1)  >  P( N,  >  j\<f>iXit))  =  1). 


LEMMA  2: 

Fit)  £  hi ly.  £(/)) 

£(W,l0U(r))  =  1) - TTpTTT - 

hiFit)) 

where  /r  (*)  is  the  reliability  function  of  and 

A(ly.  F(r))  -  £(<*(*,(/) . *;_,(/).  l.JTy+I(r) . *„(r)). 

PROOF: 

£(1V,|  *(*(/)  -  1)«  £  £U,(/)UU(/))  =  1] 

/-I 

££<*,(/)  -  1.  <*(*('))  -  1) 

I- 1 _ 

Pi<f>iXit))  =  1) 

£m((r)  =  i.  ^(jr,(r) . jr,_,(f).i.jr,+l(/) . *„o))-  n 

i-i 


hiFit)) 


and  by  the  independent  assumption  this  equals 


%PiXtit)  «  1  )P(<f>(Xt(t) . X^it).\,X,,At) . X„it ))  -  I) 

_ 

hiFit)) 

Fit)  £  hi\„  Fit)) 

/-I _ 

hiFit)) 


LEMMA  3. 


h'ip)-  £/»<!„  p)~  ±hiOl>P). 

/-i  /-i 
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PROOF: 

See  Barlow  and  Proschan  [3], 

The  following  is  the  main  theo.em  of  this  section. 

THEOREM  1: 

Let  <t>  be  an  arbitrary  coherent  structure,  then  for  any  k  ik  -  0 . n): 

PiN,  >  k\*iXit))  —  1)  is  nonincreasing  in  t. 


Thus,  given  the  system  is  up  at  t,  the  number  of  functioning  components  of  stochastically 
decreasing  in  t. 

PROOF: 

PiN,  >  k  |<^CV(f))  —  1)  can  be  written  in  the  following  way: 

X  CjiFilW  lF(f)l"-' 

j-k _ 

HiFit)) 

where  Cj  is  the  number  of  distinct  </>-path  sets  of  size  j.  Thus, 

>  k  |0(Jf(/))  -  1) 

X  c,lF(/)]>[F(t)]^ 

d_  j-k _ 

=  dt  hiFit)) 


"T7¥7TT  P(N<  >  *■  *(*(,))  ~l)x 

h2(F(t))  Fit)Fit) 


iE[N, ]  -  E[N,  |  N,>  k,  MXit))  -  1]) 

+  fit)h'iFit))PiN,  >  k.  4>iXit))  -  1)1. 


This  is  nonpositive  for  t  >  0  iff 


By  Lemma  1, 


(*)  hffML  (£[ty]  -  E[N,  |  N,  >  k.  fiiXit))  -  II)  +  h'iFit))  0. 
Fit)  Fit) 


(*)  <  (£lN,l  -  £UV,U(Af(f))  -  ID  +  h'iFit)). 

Fit)Fit) 
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By  Lemmas  2,  3: 

hiFit )) 


F(t)F(t) 


Fit)  £  hi\„  Fit)) 


nFit)  - 


i- I 


hiFit)) 


+  £  A(l„  F(t))  -  £  /i(0„  Fit)) 

i-t  i-i 


1 


F(r) 

1 

Fit) 


nhiFit))  -  F(t)  £  hill.  Fit))  -  Fit)  £  MO,,  Fit)) 
/-i  /-7 

[nhiFit))  —  nhiFit ))]  =  0. 


Thus,  4  >  k\<hiXit))  =  1)  <  0. 

at 


Note  that  even  though  the  unconditional  process  N ,  is  decreasing  stochastically  (in  fact, 
decreasing  with  probability  1,  it  is  not  always  true  for  the  process  conditional  on  <t>iXit))  =  1. 
Indeed,  for  the  nonidentical  component  cast  this  is  not  true.  The  following  example  illustrates 
it.  Let  the  system  be 


Its  structure  function  is  <t>iX)  =  max  (Y,,  X2  *  X3)  and  suppose  the  lifetimes  are  independent 
and  distributed  uniforms  on  (0,  1)  and  (0,  2)  (0,  2),  respectively,  then  check 


PIN,  >  2|<M*(r))  =  1)  <  1  forO  <  t  <  1 


=  1  for  1  <  /  <  2. 


DEFINITION. 

n 

A  structure  <t>  is  called  k-out-of-n  iff  <f>  iX)  =  1  or  0  according  to  whether  £  X,  >  k  or 

i-i 

<  k,  respectively. 

COROLLARY  1. 

If  the  component  lifetime  is  IFR,  then  the  fr-out-of-n  system  has  an  IFR  distribution. 

PROOF:  Fix  k,  ik  -  1,2 . n) 

We  have  to  show  that  for  x  >  0 

P(N,+Jt  >  k  1 2V,  >  k)  is  decreasing  in  /. 
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Let 


Sir,  t,  x)  =  £ 

j-k 


r  \Bl±A.  1  \  _  F(t  +  x> 

j  Fit)  Fit) 


where  r  ^  k,  x  >  0  and  FO)  is  the  component  lifetime  distribution  function,  which  is 
assumed  to  be  IFR.  Thus  for  fixed  x  >  0,  S  ir,  t,  x)  is  increasing  in  r  and  decreasing  in  t  and 

PiNl+x  >  k  |  N,  >  k)  -  EiSiN,,  t,  x)  I  N,>  k). 

By  Theorem  1 ,  given  N,  ^  k,  N,  is  stochastically  decreasing  in  t,  which  implies 

EiSiN,,  t,  x)\N,  >  k)  is  decreasing  in  /. 


2.  IFR  SHOCK  MODEL 

Let  {A/,:  r  >  0}  be  a  homogeneous  Poisson  process  with  intensity  X  >  0  and  let 
X\,  X2,  ...  be  a  sequence  of  i.i.d.  nonnegative  random  variables  independent  of  [N, |  and  dis¬ 
tributed  according  to  F.  The  process 

T(/)  =  X,  +  ...  +  XN/  if  N,  >  0 
0  if  N,  =  0 

is  called  a  compound  Poisson  process.  Esary,  Marshall  and  Proschan  [4]  studied  this  process 
and  derived,  among  other  things,  conditions  which  make  the  process  IFR.  (A  nonnegative  pro¬ 
cess  Y  is  IFR  iff  for  any  a  >0,  Ta  =»  inf  {/ :  Yt  >  a)  has  an  IFR  distribution.)  Their  approach 
is  purely  analytical  and  they  derive  the  conditions  using  total  positivity  theory.  We  start  with 
the  following  theorem: 

THEOREM  3: 

For  any  k  ^  0,  c  ^  0: 

PiN,  >  k  I  Yit)  <  c)  is  increasing  t. 


PROOF: 


£  F'Jic)  i\t)J/j\ 

PiN,  >  k  |  Yit)  <  c)  -  ^ - 

£  F*Jic)  ikt)J/j\ 
J- o 


where  F*J  denotes  the  fih  fold  convolution  of  F.  The  proof  is  by  induction  on  k. 
For  k  »  1 ,  check 


<L 

dt 


£  F*>ic)  i\t)J/j\ 
_ 

1  +  £  F*J(c)  iktV/j'. 

7-1 


^  0. 
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Now  assume  it  holds  for  k  <  n,  and  to  show  that  it  holds  for  k 
T  F*j(c)  (kt)J/j\  T  T 


£  F*'(c)(Xr)Vy!  £ 

j-o  >- o 

By  the  induction  hypothesis  it  suffices  to  show 


9  _/-«+! 


3  U,  9r  ..Ij 


It  is  easy  to  check  that  the  numerator  is  nonnegative  iff: 


i  [t,  ,  i,  (x^!|  -  <^'!)  > 0 

which  is  true  since  it  is  equal  to 

[F*("+i)(Xf)"+1/nn!  +  F*<"+J)  (Xr)"+2/n(«  +  1)!  +...) 

-  [F*("+1)(Xr)"+1/(/J  +  D!  +  F*{"+2)(kt)n+2/(n  +  2)!  +  ...]>  0. 

COROLLARY  2: 

Theorem  3  holds  even  when  Y(t)  is  nonhomogeneous  compound  Poisson.  We  also  need 
the  following  well  known  lemma. 

LEMMA  4. 

If  the  distribution  function  Fis  PF2,  i.e.,  if  for  any  x,  <  x2,  y,  <  y2: 

F(x,  -  y,)  F(x,  -  y2) 

F(x2  —  y ,)  F(x2  -  y2)  >0 

then  for  any  x,  <  x2  and  n  >  0 

F*"(xi)F*("+1)  (x2)  >  F*"(x2)F*("+i>  (x,) - 

PROOF: 

See  Esary-Marshall-Proschan  [4],  Theorem  4.9.  The  above  states  that  if  a  sequence  of 
i.i.d.  random  variables  X\,X2,  ...  have  a  PF2  distribution  F,  then  for  any  X|  <  x2: 
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P(X |  +  . . .  +  X„  <  *i  |  Xx  +  . . .  +  X„  <  x2)  is  decreasing  in  n. 


COROLLARY  3: 

If  F  is  PF2,  then  for  X\  <  x2 

PiYit)  <  xi  I  K(r)  <  x2)  is  decreasing  in  /. 


PROOF: 

PiYit)  <  x,l  Y(t)  ^  x2) 

=  £  PiYit)  <  x,  I  Y(t)  <  x2,  N,  =  n)P(N,  =  n  I  K(r)  <  x2) 

*-0 

«  £  PiXx  +  ...  +  X„  <  x,  I Xx  +  ...  +  X„  <  x2)P(N,  =  n  |  F(/)  <  x2) 

«-o 

given  Y(t)  <  x2,  N,  is  stochastically  increasing  in  t  and  PiXx  +  . . .  +  X„  <  xt  I  Xt  +  . . .  + 
Yn  <  x2)  is  decreasing  in  n  and  hence 

£  PUT,  +  . . .  +  Xn  <  X!  |  Xx  +  . . .  +  Xn  <  x2)PiN,  -  «  I  K(r)  <  x2) 

n-0 

is  decreasing  in  t. 

COROLLARY  4: 

If  Fis  PF7  then  the  compound  Poisson  process  Y  is  IFR. 

COROLLARY  5: 

Suppose  Y  is  nonhomogeneous  compound  Poisson  with  intensity  A(-),  then  if  Fis  PF2 
and  A  is  increasing,  Y  is  IFR  process. 

PROOF. 

Have  to  show  that  for  x  >  0,  c  >  0, 

PiYit  +  x)  <  c  I  Yit)  <  c)  is  decreasing  in  t. 

~  [r'+xA(u)d«]' 

Fix  x  >  0  and  let  hit,  y)  -  T  — - : - —  F*Jic  -  .y)  for  0  <  y  <  c.  Now  since  A  is 

j-o  J- 

increasing  in  r,  hit,  y)  is  decreasing  in  t and  y.  But 

P(Y(t  +  x)  <  cl  Yit)  <  c)-£h((f,K(r))|  Y(t)  ^  c) 

which  is  decreasing  in  t.  A  similar  result  concerning  the  nonhomogeneous  compound  Poisson 
was  obtained  by  Abdel-Hameed-Proschan  [1]  using  similar  argument  as  in  Esary-Marshall- 
Proschan  14], 
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ABSTRACT 

Consider  two  machines,  labeled  I  and  2.  A  set  of  tasks  has  to  be  processed 
first  on  machine  I  and  after  that  on  machine  2.  A  second  set  of  tasks  has  to  be 
processed  first  on  machine  2  and  after  that  on  machine  I.  All  the  processing 
times  are  exponentially  distributed.  We  present  a  policy  which  minimizes  the 
expected  completion  time  of  all  tasks. 


We  consider  two  machines,  labeled  1  and  2.  There  are  n  tasks  which  have  to  be  pro¬ 
cessed  first  on  machine  1  and  after  that  on  machine  2.  This  set  of  tasks  will  be  called  set  A. 
Moreover,  there  are  m  tasks  which  have  to  be  processed  first  on  machine  2  and  after  that  on 
machine  1.  These  tasks  will  be  called  set  B.  The  processing  time  of  task  /,  i  €  A  UB,  on 
machine  1  (2)  is  a  random  variable  exponential!}'  distributed  with  rate  X,(u,).  Any  of  these 
rates  may  be  infinite;  that  means  that  the  corresponding  processing  time  is  zero.  This  model  is 
usually  called  a  Job  Shop. 

We  are  interested  in  a  policy  which  minimizes  the  expected  completion  time  of  all 
tasks— the  so-called  makespan.  A  class  of  policies  will  be  considered  in  which  the  decision¬ 
maker,  at  any  time  when  a  machine  is  freed,  is  allowed  to  review  his  policy  and  let  his  decision 
depend  on  the  past  history  of  the  process. 

The  version  of  the  above  problem  where  the  processing  times  of  the  tasks  are  determinis¬ 
tic  has  been  treated  by  Jackson  (2],  He  presented  a  polynomial  time  algorithm  to  find  the 
optimal  schedule. 

The  special  case  where  set  B  is  empty  is  usually  called  a  Flow  Shop.  So  a  Flow  Shop  is  a 
Job  Shop  in  which  all  tasks  are  required  to  pass  through  the  successive  machines  in  the  same 
order.  Bagga  (1 1  treated  the  two-machine  Flow  Shop  with  exponential  processing  times,  i.e., 
the  Job  Shop  where  set  B  is  empty.  We  will  give  here  a  short  description  of  his  results  as  it  will 
play  a  role  in  the  proof  of  the  main  theorem  in  this  note.  As  the  order  in  which  the  tasks  are 
processed  on  the  second  machine  does  not  affect  the  makespan  in  the  Flow  Shop  model,  the 
sequence  in  which  the  tasks  are  processed  on  the  two  machines  can  be  assumed  to  be  the  same. 

So  the  Row  Shop  model  is  basically  a  sequencing  problem.  Let  j, . j„.  a  permutation  of 

1 . n,  denote  the  sequence  in  which  the  tasks  go  through  the  machines,  i.e.,  at  time  t  -  0 

task  it  starts  being  processed  on  machine  1;  after  it  finishes  its  processing  there,  it  starts  on 
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machine  2,  while  task  y2  starts  on  machine  1,  etc.  One  would  like  to  know  the  sequence  which 

minimizes  the  expected  makespan.  Let  E(F(J\ . .)„))  denote  the  expected  rnakespan  when 

using  sequence  j\ . jn. 

THEOREM  I  (Bagga): 

(i)  E(F(Jl . j,-  |.  ji.  >,+  i.  j,  +  2 . in ))  < 

E(fu i . j,- y, + 1 ,  y,+2 . 7„)) 

when 

<*,,  -M,,)  >  (X/  t| 

(ii)  Processing  the  tasks  in  decreasing  order  of X,  —  m,  minimizes  the  expected 
makespan. 

It  is  clear  that  (ii)  is  an  immediate  consequence  of  (i),  but  it  is  part  (i)  that  we  will  need 
in  the  proof  of  Theorem  2.  Theorem  1  tells  us  also  that  when  the  processing  time  of  task  /  on 
machine  1  (2)  is  zero,  i.e.,  X,  =  °°  (/x,  »  °°),  it  has  to  go  first  (last).  If  there  are  a  number  of 
tasks  with  zero  processing  times  on  machine  1,  all  these  tasks  have  to  precede  all  the  others. 
The  sequence  in  which  these  tasks  go  through  machine  2  does  not  affect  the  makespan.  A 
similar  remark  can  be  made  if  there  is  more  than  one  task  with  zero  processing  time  on 
machine  2. 

The  Job  Shop  model  under  consideration  in  this  note  can  be  viewed  as  a  Markov  Decision 
Process  in  continuous  time.  Define  the  decision  moments  to  be  the  time  epochs  that  a 
machine  is  freed.  A  policy  instructs  the  decision  maker  to  take  a  certain  action  at  a  decision 
moment  (to  start  processing  a  certain  task  on  the  machine  just  freed)  depending  upon  the  state 
of  the  system  at  that  moment.  The  state  of  the  system  at  a  decision  moment  is  determined  by 
the  tasks  which  have  been  completed  up  to  that  moment  and  the  task  which  is  still  being  pro¬ 
cessed  on  the  busy  machine.  In  Theorem  2  we  will  determine  the  optimal  policy,  i.e.,  the  pol¬ 
icy  which  minimizes  the  expected  makespan.  We  will  use  the  following  terminology:  A  "sin¬ 
gle"  task  denotes  a  task  which  only  has  to  be  processed  on  one  of  the  machines,  so  we  assume 
that  a  single  task  of  set  A  (B)  remains  only  to  be  processed  on  machine  2(1).  A  "double"  task 
denotes  a  task  which  still  has  to  be  processed  on  both  machines  in  the  prescribed  order. 
Observe  that  a  single  task  of  set  B  can  be  considered  as  a  double  task  of  set  A  with  zero  pro¬ 
cessing  time  on  machine  2. 

THEOREM  2: 

The  optimal  policy  instructs  the  decision-maker,  whenever  machine  1  (2)  is  freed,  to  start 
processing  of  the  remaining  double  tasks  of  set  A  (B)  the  one  with  the  highest  value  of  X  —  n 
(fi  —  x ).  If  no  double  tasks  of  set  A  (B)  remain,  the  decision-maker  may  start  any  one  of  the 
single  tasks. 

PROOF: 

The  proof  consists  of  two  parts.  In  the  first  part  we  compare  two  policies,  which  we  will 
call  ir  |  and  ir2.  Suppose  machine  1  is  freed  at  time  i.  Both  n  t  and  n  2  will  schedule  the  remain¬ 
ing  double  tasks  of  set  B  and  the  single  tasks  of  set  A,  which  finished  their  processing  on 
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machine  i  before  time  r,  in  the  same  sequence  on  machine  2;  under  77 ,  and  1 r2  machine  2  has 
to  process  these  tasks  before  it  processes  those  tasks  of  set  A  which  finish  on  machine  1  after 
time  1.  On  machine  1  policy  7r,  will  schedule  at  time  1  first  task  1  (with  parameters  X  ,  andM  ,) 
followed  by  task  2  (with  parameters  a  2  andM  2^  Any  of  these  parameters  may  be  infinite.  Pol¬ 
icy  ir  2  will  schedule  first  task  2  and  then  task  1.  After  finishing  these  two  tasks,  policies  n ,  and 
it  2  will  schedule  the  remaining  tasks  in  the  same  sequence  on  machine  1.  These  remaining 
tasks  include  double  tasks  as  well  as  single  tasks,  including  the  single  tasks  of  set  B  which  have 

finished  their  processing  on  machine  2  after  time  1.  Call  this  sequence  of  tasks  y, . jk.  Let 

x  denote  the  time  machine  2  still  needs  to  finish  the  remaining  double  tasks  of  set  B  and  the 

single  tasks  of  set  A  which  have  finished  their  processing  on  machine  1  before  time  t.  Let 

A'1(^2)  denote  the  random  processing  time  of  task  1  (2)  on  machine  1.  In  case  Xx  +  X2  ^  x 

the  makespans  under  and  tt2  are  clearly  equal.  So  we  only  have  to  compare  w,  and  tt2  in 
case  A',  +  Jfj  >  x  We  will  make  a  distinction  between  two  cases: 

(i)  Suppose  X\>  x  and  we  are  using  ir At  time  1  +  x  the  problem  reduces  to  a 
Flow  Shop  where  the  sequence  used  is  1,  2,  y,  ....  jk. 

(ii)  Suppose  X\  <  x  and  Xx  +  X2  >  x.  The  problem  reduces  at  time  /  +  x  to  the 
same  Flow  Shop  with  the  difference  that  task  1  has  finished  its  processing  on 
machine  1  already. 


Now  let  EU\)  (£(72))  denote  the  expected  remaining  time  to  finish  all  tasks  on  both  machines 
under  policy  *r,(jr2). 

E(J\\X\  +  X2  >  x)  ■  m,  +  x2  >  x)  = 
e~V  [x  +  E(F(  1,  2,  jx . A»1  + 


-X  ,(*-  v) 

e  2 


dy  • 


x  +  £(£(1.2.  j\ . A»"  f" 

K  1 


and 


£(y2|A,  +  x2  >  x)  •  m,  +  x2  >  x)  - 
(x  +  £(£(2.  1.  j, . A»]  + 


/  2  e  ^ 

J  y- 0  1 


-XjV  -Ajlx-y) 


dy 


x  +  £(£(2.  l.y, . h))~  T~ 


Clearly, 


e'X|Jt  +  f  X,  e"*1'  dy  =  +  f  X2  dv 

J  y- 0  1  J  y-0  2 

as  both  the  l.h.s.  and  the  r.h.s.  are  equal  to  £(*,  +  X2>  x).  And  also 


-*,*  -x  ,(*->) 
e  e 


dy\ 


A 1 


dy\ 


a  2 


as  can  be  checked  easily. 
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So 

EUi\Xi  +  X2  >  x)  -  EU2\X{  +  X2  >  x)  - 

£(F(1.  2,  Ji . jk))~  E(F( 2,  1,  y, . jk )) 

which  according  to  Theorem  1  is  positive  when  —  mi  <  X 2  —  M2-  This  completes  the  first 
part  of  the  proof. 

In  the  second  part  of  the  proof  we  use  the  result  of  the  first  part  to  show  the  theorem.  It 
is  a  well-known  fact  in  the  theory  of  Markov  Decision  Processes  that  a  policy  tt  *  is  optimal,  if 
when  using  n  *  from  any  decision  moment  t  and  state  onwards  it  results  in  a  smaller  expected 
makespan  than  acting  at  t  not  according  to  tt  *  but  from  the  next  decision  moment  onwards 
according  to  tt  * 

Let  tr  *  denote  the  policy  described  in  the  theorem  and  let  tt'  denote  a  policy  which  acts 
differently  at  t  and  acts  according  to  7r  •  from  the  next  decision  moment  onwards.  It  is  clear 
that  the  sequence  resulting  under  n'  can  be  transformed  into  the  sequence  under  n  *  through  a 
number  of  adjacent  pairwise  switches,  involving  the  tasks  scheduled  on  the  machine  which  was 
freed  at  time  t.  Each  pairwise  switch  will  cause  a  decrease  in  the  expected  makespan  as  was 
shown  in  the  first  part  of  the  proof. 

This  completes  the  proof  of  the  theorem. 
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